2026-04-14T21:02:04,356 Created temporary directory: /tmp/pip-ephem-wheel-cache-dz1g2k3h 2026-04-14T21:02:04,358 Created temporary directory: /tmp/pip-build-tracker-vct4xkk1 2026-04-14T21:02:04,359 Initialized build tracking at /tmp/pip-build-tracker-vct4xkk1 2026-04-14T21:02:04,359 Created build tracker: /tmp/pip-build-tracker-vct4xkk1 2026-04-14T21:02:04,360 Entered build tracker: /tmp/pip-build-tracker-vct4xkk1 2026-04-14T21:02:04,360 Created temporary directory: /tmp/pip-wheel-wsede60p 2026-04-14T21:02:04,364 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-04-14T21:02:04,366 Created temporary directory: /tmp/pip-ephem-wheel-cache-r285t4ef 2026-04-14T21:02:04,388 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-04-14T21:02:04,391 2 location(s) to search for versions of flashinfer-python: 2026-04-14T21:02:04,391 * https://pypi.org/simple/flashinfer-python/ 2026-04-14T21:02:04,391 * https://www.piwheels.org/simple/flashinfer-python/ 2026-04-14T21:02:04,392 Fetching project page and analyzing links: https://pypi.org/simple/flashinfer-python/ 2026-04-14T21:02:04,393 Getting page https://pypi.org/simple/flashinfer-python/ 2026-04-14T21:02:04,394 Found index url https://pypi.org/simple 2026-04-14T21:02:04,547 Fetched page https://pypi.org/simple/flashinfer-python/ as application/vnd.pypi.simple.v1+json 2026-04-14T21:02:04,562 Found link https://files.pythonhosted.org/packages/6c/e9/5d6adcf888922a17c6fc52a0e5bed78785239af1219f41e1073b063a07ff/flashinfer_python-0.2.0.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.0.post1 2026-04-14T21:02:04,564 Found link https://files.pythonhosted.org/packages/c8/39/bac839234a3beaab4292e489b4d8941cc97ba4f76474aff0407d7b05a84f/flashinfer_python-0.2.0.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.0.post2 2026-04-14T21:02:04,565 Found link https://files.pythonhosted.org/packages/94/74/4dda2a7a7aa08bcfb8039faf2202bf0fea6b378d0d4968864737400fc329/flashinfer_python-0.2.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.1 2026-04-14T21:02:04,566 Found link https://files.pythonhosted.org/packages/7f/3d/aab500609825108d3f6a4b440a7eeb6436d578d3e781e97ea015fd49a530/flashinfer_python-0.2.1.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.1.post1 2026-04-14T21:02:04,567 Found link https://files.pythonhosted.org/packages/30/ac/afd1d2c472857be8f83389eb506e1413a2ac3a603889bea3cf24d5ab5be5/flashinfer_python-0.2.1.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.1.post2 2026-04-14T21:02:04,569 Found link https://files.pythonhosted.org/packages/90/00/833dd50745bc15bb7a7451b77589d444ce963d48c0cb730b4760bfebffad/flashinfer_python-0.2.2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.2 2026-04-14T21:02:04,570 Found link https://files.pythonhosted.org/packages/02/cc/db9635c56653d3fa5a28f14ac858e0801de621aa33d3b528e4781aee906f/flashinfer_python-0.2.2.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.2.post1 2026-04-14T21:02:04,571 Found link https://files.pythonhosted.org/packages/b6/10/2a63f1d09c5b337705236005dc9ccce513dcc08b7fd037cb40426f1695b1/flashinfer_python-0.2.3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.3 2026-04-14T21:02:04,572 Found link https://files.pythonhosted.org/packages/a4/e5/8d193ccf65b92c009c4be50fdffa88fa0edc8fd6e6169bacaca6bab84d89/flashinfer_python-0.2.4.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.4 2026-04-14T21:02:04,573 Found link https://files.pythonhosted.org/packages/b2/c4/9ec0f79e2480fc5c93307c4a1ac903e5cf33c551c0eaeb648196234b55af/flashinfer_python-0.2.5.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8), version: 0.2.5 2026-04-14T21:02:04,575 Found link https://files.pythonhosted.org/packages/95/4a/a3109d57463d25a153b16c0d0f06495e4d18b727c81f8e08e42e97faaf45/flashinfer_python-0.2.6.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.6 2026-04-14T21:02:04,576 Found link https://files.pythonhosted.org/packages/34/26/3c6f12ffaefbfa0c453030d6e15941269b3a4ffcd267daec32d1a10dda96/flashinfer_python-0.2.6.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.6.post1 2026-04-14T21:02:04,577 Found link https://files.pythonhosted.org/packages/f9/a0/5e700751f2393a504bc5eb2879e77d783a5b70778a254289711323126abc/flashinfer_python-0.2.7.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.7 2026-04-14T21:02:04,578 Found link https://files.pythonhosted.org/packages/c0/10/43cf1ea7a03ca8e75a185190708e48286e1583d781e93d1de130e5d450ca/flashinfer_python-0.2.7.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.7.post1 2026-04-14T21:02:04,579 Found link https://files.pythonhosted.org/packages/f1/80/8dfae62d04af4597d7615b892f346ace68bcb07dfbef2a9e614219d96a8a/flashinfer_python-0.2.8rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.8rc1 2026-04-14T21:02:04,580 Found link https://files.pythonhosted.org/packages/72/0e/827624993516e80f62ba88dd368ad5e180c41324f063c00d27fa638a430e/flashinfer_python-0.2.8.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.8 2026-04-14T21:02:04,581 Found link https://files.pythonhosted.org/packages/17/50/42afc9a81031939140fcbfd93e5a3652dc4995e338b4e6d007b0dda04f93/flashinfer_python-0.2.9rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.9rc1 2026-04-14T21:02:04,582 Found link https://files.pythonhosted.org/packages/ed/1a/9f30eda3178ed2f5f7e311ae0011d02c4542d087f84c9247e4b30668b767/flashinfer_python-0.2.9rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.9rc2 2026-04-14T21:02:04,583 Found link https://files.pythonhosted.org/packages/45/fc/4deff13f1420cc6e5871b7505a6c0d9031eb49cd09571ae576aec59bed61/flashinfer_python-0.2.9.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.9 2026-04-14T21:02:04,584 Found link https://files.pythonhosted.org/packages/74/e4/2c6d6a19d13ed13d4863f6900febe72b502334e43292d5fe9a1ac2f6c5be/flashinfer_python-0.2.10.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.10 2026-04-14T21:02:04,585 Found link https://files.pythonhosted.org/packages/72/8b/f315dda5993d1c018ca5ecfef0775c6a3c7a8f59ac426fabb7f3f6b93482/flashinfer_python-0.2.11.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11 2026-04-14T21:02:04,586 Found link https://files.pythonhosted.org/packages/37/e3/2e8e31f7f7ee26f39968264e4fcf74f9810d90e940859016d974106ed5c6/flashinfer_python-0.2.11.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11.post1 2026-04-14T21:02:04,587 Found link https://files.pythonhosted.org/packages/b6/01/fa069f076cfe5bed34ddc3b7f772aa09c70e03e572dd9d3569ff887f33b1/flashinfer_python-0.2.11.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11.post2 2026-04-14T21:02:04,588 Found link https://files.pythonhosted.org/packages/a3/09/5d89ef0bc2d19d3ebcf3b9fa621c945909f681818c9d55aa3181921db874/flashinfer_python-0.2.11.post3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.11.post3 2026-04-14T21:02:04,589 Found link https://files.pythonhosted.org/packages/b9/5a/7a839afb07af313549b9d9f1057b02aaf067f020267d5a9d128e50596bf4/flashinfer_python-0.2.12.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.12 2026-04-14T21:02:04,590 Found link https://files.pythonhosted.org/packages/f2/20/e79142a9f26aab61b17e2c906a49e9a3d3c656d97608c8773785c3b13140/flashinfer_python-0.2.13.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.13 2026-04-14T21:02:04,591 Found link https://files.pythonhosted.org/packages/ed/26/d1eac56b37d225cb3f84495bd897829dece21f62463487f3c1d9cafe78a0/flashinfer_python-0.2.14.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.14 2026-04-14T21:02:04,592 Found link https://files.pythonhosted.org/packages/94/d4/4a2bf3d49f84b2d975925c1c024790b4e4768bdefbc5e27529d68368355a/flashinfer_python-0.2.14.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.2.14.post1 2026-04-14T21:02:04,593 Found link https://files.pythonhosted.org/packages/56/e3/7c0a4df2640a97ecfed45fe9110ecc6a67d4967278723abf8e6531b6bc1f/flashinfer_python-0.3.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.0rc1 2026-04-14T21:02:04,594 Found link https://files.pythonhosted.org/packages/1f/b4/5c4cbb0f3cbc5e8d4c19b3f163c048eed959a0ac0c603cfb3939a3079c52/flashinfer_python-0.3.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.0 2026-04-14T21:02:04,595 Found link https://files.pythonhosted.org/packages/59/1b/83a9c58432b4a5d6ff04b97d4873bedfb5e35d38972ca8946b3acdbffeb4/flashinfer_python-0.3.0.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.0.post1 2026-04-14T21:02:04,596 Found link https://files.pythonhosted.org/packages/ba/71/dd3001b8be8174d90561764a5f3be4ca219517bde2841189ea6973a3873f/flashinfer_python-0.3.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.1 2026-04-14T21:02:04,597 Found link https://files.pythonhosted.org/packages/49/a7/f5bd3878f94fc47e25ecc0828f910233022366f7e832dfa02f3617fad41f/flashinfer_python-0.3.1.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.3.1.post1 2026-04-14T21:02:04,598 Found link https://files.pythonhosted.org/packages/df/b4/f113bb950e5244d1c72c3d73c03fac0db939f085670e3a45a41fe92ffde0/flashinfer_python-0.4.0rc0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc0 2026-04-14T21:02:04,599 Found link https://files.pythonhosted.org/packages/2e/a8/adceccda3aae01b7bdb5f99c68a2b401c58600f34a6386d9489ff736cdbc/flashinfer_python-0.4.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc1 2026-04-14T21:02:04,600 Found link https://files.pythonhosted.org/packages/15/c0/5fb88fc273fed23dbf3b0ef0bffa7db26e2df24e016202df1b4e98b95879/flashinfer_python-0.4.0rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc2 2026-04-14T21:02:04,602 Found link https://files.pythonhosted.org/packages/65/91/cf9e3a0a2626711bfab18ea4a4c739e0eb823e9513addc0e9e1b8f929538/flashinfer_python-0.4.0rc3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc3 2026-04-14T21:02:04,603 Found link https://files.pythonhosted.org/packages/94/ec/bdcc0ec502994d544cbe69763d999458ae2deda67e58c1cb2d85867677c4/flashinfer_python-0.4.0rc4.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0rc4 2026-04-14T21:02:04,604 Found link https://files.pythonhosted.org/packages/08/29/f5609be182174e8c97124baeb90bb955fe05e2e1353776f48e226c153214/flashinfer_python-0.4.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.0 2026-04-14T21:02:04,605 Found link https://files.pythonhosted.org/packages/64/cf/f82142abd7c819fb84a53f18fe1ac9e7cf1af8790b93c06dbf430001473b/flashinfer_python-0.4.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.4.1 2026-04-14T21:02:04,606 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/c7/92/126dacc3476fab07478bdfc9944abd22aafa1000088d93bf86fb9ec78a29/flashinfer_python-0.5.0rc1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-14T21:02:04,607 Found link https://files.pythonhosted.org/packages/53/47/a759f1ae9ef4ceb4e12895665b65dfacea2085494626e764627dd3548fa8/flashinfer_python-0.5.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.0rc1 2026-04-14T21:02:04,608 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/fb/aa/7b5d28c2aec11acfce18f2655d0b4614c7e34547fab218b4f2fd0d57bdce/flashinfer_python-0.5.0rc2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-14T21:02:04,609 Found link https://files.pythonhosted.org/packages/3d/5a/58a7b60f79a1ac9c652b4055b06e88b5f57e8ef4c7dd4830ef48fa4cc265/flashinfer_python-0.5.0rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.0rc2 2026-04-14T21:02:04,609 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/5f/8f/7077cf0a44056a65045a793d6d55845d95818fb6455bfebb44ddea7e1f12/flashinfer_python-0.5.0rc3-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-14T21:02:04,610 Found link https://files.pythonhosted.org/packages/60/d1/8c90d6dfc95ab609028e9d541a6cdb3483f5c1475b07d97465ff3f0db14c/flashinfer_python-0.5.0rc3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.0rc3 2026-04-14T21:02:04,611 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/eb/8a/425b75b44ce5eeefe01dd61d4ee260b8e5f9dcf1a500d5f08d6cd4095d3a/flashinfer_python-0.5.0-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-14T21:02:04,612 Found link https://files.pythonhosted.org/packages/e3/1d/b82cd2606f4f0033e2fb28194dc3b04fd8101643e4ceb1d13fb1466cfd28/flashinfer_python-0.5.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.0 2026-04-14T21:02:04,613 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/f4/f1/33dedad087a2bc3d66244126bd5d1c79721ea22d1f2124299f9e5bdaf3b1/flashinfer_python-0.5.1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-14T21:02:04,614 Found link https://files.pythonhosted.org/packages/6c/bb/897c3b9d683dcf6490f70e468efb585eebcd673970b13a04ed947b491982/flashinfer_python-0.5.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.1 2026-04-14T21:02:04,615 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/8d/0c/4a8ffbbc0d85e314f534cf5c32711f2af5d5e6e49225a5a414400a67b684/flashinfer_python-0.5.2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-14T21:02:04,616 Found link https://files.pythonhosted.org/packages/d8/04/e357eaa50238e12c49e66fcf47f83e066e741ef19a117c136782b32eafbb/flashinfer_python-0.5.2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9), version: 0.5.2 2026-04-14T21:02:04,616 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/76/78/6dc7e7da8cb87c9965644ea0d2439457a1bc9256c45ceda0044595be4143/flashinfer_python-0.5.3-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,618 Found link https://files.pythonhosted.org/packages/b4/91/cca69baeff24bb3efd12c7479a026432c8717ee47193694010494c528b22/flashinfer_python-0.5.3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.5.3 2026-04-14T21:02:04,618 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/b2/0c/cb2d60eb86f0171451d676f17b90484ab66baf73c54cefe15c9a7c800739/flashinfer_python-0.6.0rc1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,619 Found link https://files.pythonhosted.org/packages/53/2a/e855be4851ad6bfcebed929807fb541715f9a3a7d7b239b696e635b49d0e/flashinfer_python-0.6.0rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.0rc1 2026-04-14T21:02:04,620 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/05/22/9193f1da2468acec8ba99c4bee8aeacbda489777acf00b5871a73209acf7/flashinfer_python-0.6.0rc2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,621 Found link https://files.pythonhosted.org/packages/1b/71/dd1bb86ea531e5c1a34f8ad851901bf2e2ce500618b5a4da19bd69f7de11/flashinfer_python-0.6.0rc2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.0rc2 2026-04-14T21:02:04,622 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/33/90/5834597488f5ea62b1cc874338125c79ce21c11d777ac6f7b47f12cf2bb3/flashinfer_python-0.6.0-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,622 Found link https://files.pythonhosted.org/packages/ad/8d/c7330f27f09b9110af2f6c44c6f68d7b536f525f8ac539210073bfcdb965/flashinfer_python-0.6.0.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.0 2026-04-14T21:02:04,623 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/76/d5/bca632bb5781689415186421bbee2ad39ae8a39b0996d579c76901e5c66f/flashinfer_python-0.6.1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,624 Found link https://files.pythonhosted.org/packages/68/81/5a84e14df7358d2c2903b18c6f2779bd4b4a6739076d01a847d4c18fb102/flashinfer_python-0.6.1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.1 2026-04-14T21:02:04,625 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/aa/c0/ee819d16f6b40e287727bb3db471f4eaa9e0372e233bf2f7343faaa3009f/flashinfer_python-0.6.2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,626 Found link https://files.pythonhosted.org/packages/89/86/b25115177606ae3b6cec373d290798c28e185d033b66f6b80a89589e7786/flashinfer_python-0.6.2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.2 2026-04-14T21:02:04,627 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/33/13/2d95248101d8cb978db9000a4dceafb5b122484a694b53e84df1ac2a7b3d/flashinfer_python-0.6.3-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,628 Found link https://files.pythonhosted.org/packages/d6/aa/c564313b42dee7573da4ed0e441844f0c2bd827aecc9f29ea02c3838ffae/flashinfer_python-0.6.3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.3 2026-04-14T21:02:04,628 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/17/9a/d2bab76d2bb15062c6a2329614653e4f8bec9c78eec9069856ef0c7c0a79/flashinfer_python-0.6.4-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,629 Found link https://files.pythonhosted.org/packages/77/45/15645d2a4ee81d08206f3e132a77323e48312f510462415d7cd1122eba43/flashinfer_python-0.6.4.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.4 2026-04-14T21:02:04,631 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/4f/83/eea2a74700b5fcae36ee2b748db9c3554a83a3f9e2dc4f3816369c5cb653/flashinfer_python-0.6.5-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,632 Found link https://files.pythonhosted.org/packages/e2/2f/5c52276af3cc40ac1f6eaf823ccd8e257f77e2fe5d465fa641ad3dba4d1b/flashinfer_python-0.6.5.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.5 2026-04-14T21:02:04,632 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/e0/61/385d06755f3ab66333018285657adf0daf8a90a129448231fd09e315bd2e/flashinfer_python-0.6.6-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,633 Found link https://files.pythonhosted.org/packages/03/70/c5a235297351021f5d3d3233523a85f5a6468495587489ad2f257e8eafe2/flashinfer_python-0.6.6.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.6 2026-04-14T21:02:04,634 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/f1/e8/91361a5f07667f36181cfd08e2d7d28be4cae2aa5a24016339174b308c38/flashinfer_python-0.6.7-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,635 Found link https://files.pythonhosted.org/packages/d9/2d/aa36fa1fee744c46fef99436baea5cda4a34244846c1df0fea97eaa9a856/flashinfer_python-0.6.7.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.7 2026-04-14T21:02:04,635 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/16/92/516c79e5d8d1f0b41793e499c37a9299115ac8bc05171661b30d4a94beb8/flashinfer_python-0.6.7.post1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,636 Found link https://files.pythonhosted.org/packages/60/6c/4b1a3d380c04306bde63412043e679d5a52d3da7feed91f1e9ba8ce8bc3f/flashinfer_python-0.6.7.post1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.7.post1 2026-04-14T21:02:04,637 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/62/9e/bf26a95bb219eb3d43cc6f3cd1dde6f560081fbcb50f846535c9f571a807/flashinfer_python-0.6.7.post2-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,638 Found link https://files.pythonhosted.org/packages/cc/95/81eafb78574312db79ef7144a4e77f2fee015343f413ef3000f279c8a118/flashinfer_python-0.6.7.post2.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.7.post2 2026-04-14T21:02:04,639 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/01/6b/4117cd7cbeff07818ae7c6b8bf5a6d1ee3eed29356672b731b55af3d4453/flashinfer_python-0.6.7.post3-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,640 Found link https://files.pythonhosted.org/packages/12/b5/466778818d195b96a062467ee389d0fcfa51fdfecad4a831922916d4c48a/flashinfer_python-0.6.7.post3.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.7.post3 2026-04-14T21:02:04,640 Skipping link: No binaries permitted for flashinfer-python: https://files.pythonhosted.org/packages/6a/0a/e8ae05fd59f800e74ec24fa6a58a04c6c0d9308917880c42f2b53cfe36bb/flashinfer_python-0.6.8rc1-py3-none-any.whl (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,642 Found link https://files.pythonhosted.org/packages/68/e1/67b0b5eb9f3ea23e05e7d454571ad7a186ede6a9c30fec55e51291bfa461/flashinfer_python-0.6.8rc1.tar.gz (from https://pypi.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10), version: 0.6.8rc1 2026-04-14T21:02:04,643 Fetching project page and analyzing links: https://www.piwheels.org/simple/flashinfer-python/ 2026-04-14T21:02:04,643 Getting page https://www.piwheels.org/simple/flashinfer-python/ 2026-04-14T21:02:04,645 Found index url https://www.piwheels.org/simple 2026-04-14T21:02:04,815 Fetched page https://www.piwheels.org/simple/flashinfer-python/ as text/html 2026-04-14T21:02:04,825 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.7.post3-py3-none-any.whl#sha256=7a81720af5bdc04efcb67207f3867adb1b068f961d2e048e55baf32fb8e2cfc5 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,825 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.7.post1-py3-none-any.whl#sha256=c9bf5183228f6636ddb26d7354f250af4b2385876527538a0ff7f94fd48207d2 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,826 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.7-py3-none-any.whl#sha256=9b349825a2d26c3e4653c594d7a1d7b2126a43b29a4a70a6d48f3aaac23b96f3 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,826 Skipping link: No binaries permitted for flashinfer-python: https://www.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.6-py3-none-any.whl#sha256=94791e01c31510c057b4decabff24cbc62466682667867e84214c62c45d9b343 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,827 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.5-py3-none-any.whl#sha256=4b0a6c246959ca2dbc232fa1fe2f17ff857fd258de5dfacfa45033f21b6b7b93 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,827 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.4-py3-none-any.whl#sha256=22ee7972266bb31ce1583330769efc0ecd001fb70371531ce4c77f2d6eda0d59 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,828 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.3-py3-none-any.whl#sha256=ed3282188580afd663819924a772b2b531ac5bb88438bbe89d0baf67fe8c9fa5 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,829 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.1-py3-none-any.whl#sha256=9e0e308062a81d4e4c462313bfe33edce7712309e8c89aed722065249e644833 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,829 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.0-py3-none-any.whl#sha256=7ebc0582df714a933fc4c58ed4d12f4e61b4ad30b22b9155f290e96ee3eee3a0 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,830 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.0rc2-py3-none-any.whl#sha256=63057b7ee43a4f6764c6ed8fe4c4c6de5a94da058fe0975bf279db0567c26204 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,830 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.6.0rc1-py3-none-any.whl#sha256=e30a125bf89f8155f83aca80e5fb88a3d81224225485ce70f0f4c4c3a27da92c (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,831 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.3-py3-none-any.whl#sha256=1de562233dfbd8de835c2eb757275a7759eda034460093c1eb9ff3c7d5c0845d (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.10) 2026-04-14T21:02:04,831 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.2-py3-none-any.whl#sha256=bd3d206d1243bee523cf6cda27e0219e8fdf9026ade2e32045c8d9d4b7f7bf7a (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-14T21:02:04,832 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.1-py3-none-any.whl#sha256=8d73e4b66b7eb7fc4500f7f7e61aa194efebc769e7da1635a86506c97bf6fa0d (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-14T21:02:04,832 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.0-py3-none-any.whl#sha256=ac991d1911cff4a7453f02d88922803e7ca794a0af1dceaa920e33b81c78f5c8 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-14T21:02:04,833 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.0rc3-py3-none-any.whl#sha256=8799f4a93afc14042ac6f521f6fb682e4d62d738dc18a1e8798b7a2ba5b2e4ec (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-14T21:02:04,833 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.0rc2-py3-none-any.whl#sha256=4ee4d438c8c7fdc242a917c3f97076562f3c44411dcaceb4f7d29082c41c0f8c (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-14T21:02:04,834 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.5.0rc1-py3-none-any.whl#sha256=a9d675075f3cb79ac1b5cba9e8430496d3983127609dc780a117b2b44bdb025d (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-14T21:02:04,834 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.4.1-py3-none-any.whl#sha256=8fc8fc3233781e384689c5f202124ae7d266cb8dee14055cbb3c90fca530bf7f (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-14T21:02:04,835 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.4.0-py3-none-any.whl#sha256=da0141b2163f9703e49972728eeb502d45eda60c25529a460d0d0d61963eedb2 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.9) 2026-04-14T21:02:04,835 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.5-py3-none-any.whl#sha256=cb2a17c3ea5f47f8129f6410e2892f30051e15665f2ae54db540c8677c187d31 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-14T21:02:04,836 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.4-py3-none-any.whl#sha256=4a85bd6ac785f106f0ad9fe213abf42f96ab84ccd04aec3ab9acf76d47d2aa3f (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-14T21:02:04,836 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.3-py3-none-any.whl#sha256=b8ead688a4857a2b360c992fb46ae2930fc4c43b50a092b7e42a13b40ee195da (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-14T21:02:04,837 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.2.post1-py3-none-any.whl#sha256=0097a08376ae147084ea6bd0848fc2ea1764f524c510a48755aa8c63259b4466 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-14T21:02:04,837 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.2-py3-none-any.whl#sha256=c109a340b7e60cb57d8c9ccec2c10e303a36b82a56ba8dcaaa0efbee2a48b97f (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-14T21:02:04,838 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.1.post2-py3-none-any.whl#sha256=dc91f387ba09e4df899238705ec37bbe3648395d828240b77db84378d1b91e9e (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-14T21:02:04,838 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.1.post1-py3-none-any.whl#sha256=a44b9d872cf2ba6812d3c0750d98ad01b73e9ccbede933c7eade01b6c27b6232 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-14T21:02:04,839 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.1-py3-none-any.whl#sha256=e07427d9eff1b8d091b5837c3ffc4fe7885dbf01d271d7225f7a89a2e3925f27 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-14T21:02:04,839 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.0.post2-py3-none-any.whl#sha256=52c20b84ef1e848dd49c726ffc27801df8acccb4038aea61a2d73fa685bf75f8 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-14T21:02:04,840 Skipping link: No binaries permitted for flashinfer-python: https://archive1.piwheels.org/simple/flashinfer-python/flashinfer_python-0.2.0.post1-py3-none-any.whl#sha256=783c1039e0a7db0478a579d5cc54894def70ae601b1e5b90a3c3de2209334bf3 (from https://www.piwheels.org/simple/flashinfer-python/) (requires-python:<4.0,>=3.8) 2026-04-14T21:02:04,840 Skipping link: not a file: https://www.piwheels.org/simple/flashinfer-python/ 2026-04-14T21:02:04,841 Skipping link: not a file: https://pypi.org/simple/flashinfer-python/ 2026-04-14T21:02:04,870 Given no hashes to check 1 links for project 'flashinfer-python': discarding no candidates 2026-04-14T21:02:04,889 Collecting flashinfer-python==0.6.8rc1 2026-04-14T21:02:04,892 Created temporary directory: /tmp/pip-unpack-7hnp8mej 2026-04-14T21:02:05,133 Downloading flashinfer_python-0.6.8rc1.tar.gz (6.7 MB) 2026-04-14T21:02:12,566 Added flashinfer-python==0.6.8rc1 from https://files.pythonhosted.org/packages/68/e1/67b0b5eb9f3ea23e05e7d454571ad7a186ede6a9c30fec55e51291bfa461/flashinfer_python-0.6.8rc1.tar.gz to build tracker '/tmp/pip-build-tracker-vct4xkk1' 2026-04-14T21:02:12,574 Created temporary directory: /tmp/pip-build-env-s1pbafqb 2026-04-14T21:02:12,579 Installing build dependencies: started 2026-04-14T21:02:12,580 Running command pip subprocess to install build dependencies 2026-04-14T21:02:13,709 Using pip 23.0.1 from /usr/lib/python3/dist-packages/pip (python 3.11) 2026-04-14T21:02:14,134 DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 2026-04-14T21:02:14,158 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple 2026-04-14T21:02:15,905 Collecting setuptools>=77 2026-04-14T21:02:16,049 Using cached https://www.piwheels.org/simple/setuptools/setuptools-82.0.1-py3-none-any.whl (1.0 MB) 2026-04-14T21:02:16,337 Collecting packaging>=24 2026-04-14T21:02:16,387 Using cached https://www.piwheels.org/simple/packaging/packaging-26.0-py3-none-any.whl (74 kB) 2026-04-14T21:02:17,127 Collecting apache-tvm-ffi!=0.1.8,!=0.1.8.post0,<0.2,>=0.1.6 2026-04-14T21:02:17,332 Downloading https://archive1.piwheels.org/simple/apache-tvm-ffi/apache_tvm_ffi-0.1.10-cp311-cp311-linux_armv7l.whl (2.6 MB) 2026-04-14T21:02:17,537 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 2.6/2.6 MB 13.0 MB/s eta 0:00:00 2026-04-14T21:02:17,769 Collecting typing-extensions>=4.5 2026-04-14T21:02:17,788 Using cached https://www.piwheels.org/simple/typing-extensions/typing_extensions-4.15.0-py3-none-any.whl (44 kB) 2026-04-14T21:02:21,003 Installing collected packages: typing-extensions, setuptools, packaging, apache-tvm-ffi 2026-04-14T21:02:25,360 Creating /tmp/pip-build-env-s1pbafqb/overlay/local/bin 2026-04-14T21:02:25,362 changing mode of /tmp/pip-build-env-s1pbafqb/overlay/local/bin/tvm-ffi-config to 755 2026-04-14T21:02:25,364 changing mode of /tmp/pip-build-env-s1pbafqb/overlay/local/bin/tvm-ffi-stubgen to 755 2026-04-14T21:02:25,397 Successfully installed apache-tvm-ffi-0.1.10 packaging-26.0 setuptools-82.0.1 typing-extensions-4.15.0 2026-04-14T21:02:25,703 Installing build dependencies: finished with status 'done' 2026-04-14T21:02:25,710 Getting requirements to build wheel: started 2026-04-14T21:02:25,711 Running command Getting requirements to build wheel 2026-04-14T21:02:31,565 Build metadata file already exists (not in git repo), keeping it 2026-04-14T21:02:31,634 Getting requirements to build wheel: finished with status 'done' 2026-04-14T21:02:31,637 Created temporary directory: /tmp/pip-modern-metadata-uq2lly40 2026-04-14T21:02:31,640 Preparing metadata (pyproject.toml): started 2026-04-14T21:02:31,641 Running command Preparing metadata (pyproject.toml) 2026-04-14T21:02:38,342 /tmp/pip-build-env-s1pbafqb/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:483: SetuptoolsDeprecationWarning: Cannot find any files for the given pattern. 2026-04-14T21:02:38,342 !! 2026-04-14T21:02:38,344 ******************************************************************************** 2026-04-14T21:02:38,344 Pattern 'LICENSE*.txt' did not match any files. 2026-04-14T21:02:38,345 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-04-14T21:02:38,346 or your builds will no longer be supported. 2026-04-14T21:02:38,346 ******************************************************************************** 2026-04-14T21:02:38,347 !! 2026-04-14T21:02:38,348 for path in sorted(cls._find_pattern(pattern, enforce_match)) 2026-04-14T21:02:38,351 Build metadata file already exists (not in git repo), keeping it 2026-04-14T21:02:38,352 running dist_info 2026-04-14T21:02:38,364 creating /tmp/pip-modern-metadata-uq2lly40/flashinfer_python.egg-info 2026-04-14T21:02:38,365 writing /tmp/pip-modern-metadata-uq2lly40/flashinfer_python.egg-info/PKG-INFO 2026-04-14T21:02:38,370 writing dependency_links to /tmp/pip-modern-metadata-uq2lly40/flashinfer_python.egg-info/dependency_links.txt 2026-04-14T21:02:38,372 writing entry points to /tmp/pip-modern-metadata-uq2lly40/flashinfer_python.egg-info/entry_points.txt 2026-04-14T21:02:38,375 writing requirements to /tmp/pip-modern-metadata-uq2lly40/flashinfer_python.egg-info/requires.txt 2026-04-14T21:02:38,376 writing top-level names to /tmp/pip-modern-metadata-uq2lly40/flashinfer_python.egg-info/top_level.txt 2026-04-14T21:02:38,377 writing manifest file '/tmp/pip-modern-metadata-uq2lly40/flashinfer_python.egg-info/SOURCES.txt' 2026-04-14T21:02:39,228 reading manifest file '/tmp/pip-modern-metadata-uq2lly40/flashinfer_python.egg-info/SOURCES.txt' 2026-04-14T21:02:39,230 adding license file 'LICENSE' 2026-04-14T21:02:39,306 writing manifest file '/tmp/pip-modern-metadata-uq2lly40/flashinfer_python.egg-info/SOURCES.txt' 2026-04-14T21:02:39,311 creating '/tmp/pip-modern-metadata-uq2lly40/flashinfer_python-0.6.8rc1.dist-info' 2026-04-14T21:02:39,442 Preparing metadata (pyproject.toml): finished with status 'done' 2026-04-14T21:02:39,448 Source in /tmp/pip-wheel-wsede60p/flashinfer-python_f8f2f3ab42c149df84d896df3a06bb33 has version 0.6.8rc1, which satisfies requirement flashinfer-python==0.6.8rc1 from https://files.pythonhosted.org/packages/68/e1/67b0b5eb9f3ea23e05e7d454571ad7a186ede6a9c30fec55e51291bfa461/flashinfer_python-0.6.8rc1.tar.gz 2026-04-14T21:02:39,449 Removed flashinfer-python==0.6.8rc1 from https://files.pythonhosted.org/packages/68/e1/67b0b5eb9f3ea23e05e7d454571ad7a186ede6a9c30fec55e51291bfa461/flashinfer_python-0.6.8rc1.tar.gz from build tracker '/tmp/pip-build-tracker-vct4xkk1' 2026-04-14T21:02:39,455 Created temporary directory: /tmp/pip-unpack-ykbi4ur1 2026-04-14T21:02:39,456 Building wheels for collected packages: flashinfer-python 2026-04-14T21:02:39,461 Created temporary directory: /tmp/pip-wheel-wd01vk05 2026-04-14T21:02:39,461 Destination directory: /tmp/pip-wheel-wd01vk05 2026-04-14T21:02:39,464 Building wheel for flashinfer-python (pyproject.toml): started 2026-04-14T21:02:39,465 Running command Building wheel for flashinfer-python (pyproject.toml) 2026-04-14T21:02:45,402 /tmp/pip-build-env-s1pbafqb/overlay/local/lib/python3.11/dist-packages/setuptools/dist.py:483: SetuptoolsDeprecationWarning: Cannot find any files for the given pattern. 2026-04-14T21:02:45,403 !! 2026-04-14T21:02:45,404 ******************************************************************************** 2026-04-14T21:02:45,405 Pattern 'LICENSE*.txt' did not match any files. 2026-04-14T21:02:45,406 By 2027-Feb-18, you need to update your project and remove deprecated calls 2026-04-14T21:02:45,406 or your builds will no longer be supported. 2026-04-14T21:02:45,407 ******************************************************************************** 2026-04-14T21:02:45,408 !! 2026-04-14T21:02:45,409 for path in sorted(cls._find_pattern(pattern, enforce_match)) 2026-04-14T21:02:45,409 Build metadata file already exists (not in git repo), keeping it 2026-04-14T21:02:45,410 running bdist_wheel 2026-04-14T21:02:45,429 running build 2026-04-14T21:02:45,430 running build_py 2026-04-14T21:02:45,436 creating build/lib 2026-04-14T21:02:45,438 copying build_backend.py -> build/lib 2026-04-14T21:02:45,440 copying build_utils.py -> build/lib 2026-04-14T21:02:45,443 creating build/lib/flashinfer 2026-04-14T21:02:45,444 copying flashinfer/sampling.py -> build/lib/flashinfer 2026-04-14T21:02:45,448 copying flashinfer/compilation_context.py -> build/lib/flashinfer 2026-04-14T21:02:45,450 copying flashinfer/pod.py -> build/lib/flashinfer 2026-04-14T21:02:45,453 copying flashinfer/gdn_prefill.py -> build/lib/flashinfer 2026-04-14T21:02:45,455 copying flashinfer/__init__.py -> build/lib/flashinfer 2026-04-14T21:02:45,458 copying flashinfer/gdn_decode.py -> build/lib/flashinfer 2026-04-14T21:02:45,460 copying flashinfer/decode.py -> build/lib/flashinfer 2026-04-14T21:02:45,465 copying flashinfer/fp8_quantization.py -> build/lib/flashinfer 2026-04-14T21:02:45,467 copying flashinfer/version.py -> build/lib/flashinfer 2026-04-14T21:02:45,469 copying flashinfer/_build_meta.py -> build/lib/flashinfer 2026-04-14T21:02:45,470 copying flashinfer/deep_gemm.py -> build/lib/flashinfer 2026-04-14T21:02:45,473 copying flashinfer/sparse.py -> build/lib/flashinfer 2026-04-14T21:02:45,476 copying flashinfer/xqa.py -> build/lib/flashinfer 2026-04-14T21:02:45,478 copying flashinfer/tllm_enums.py -> build/lib/flashinfer 2026-04-14T21:02:45,481 copying flashinfer/trtllm_low_latency_gemm.py -> build/lib/flashinfer 2026-04-14T21:02:45,483 copying flashinfer/autotuner.py -> build/lib/flashinfer 2026-04-14T21:02:45,486 copying flashinfer/cuda_utils.py -> build/lib/flashinfer 2026-04-14T21:02:45,488 copying flashinfer/fp4_quantization.py -> build/lib/flashinfer 2026-04-14T21:02:45,490 copying flashinfer/prefill.py -> build/lib/flashinfer 2026-04-14T21:02:45,496 copying flashinfer/concat_ops.py -> build/lib/flashinfer 2026-04-14T21:02:45,498 copying flashinfer/page.py -> build/lib/flashinfer 2026-04-14T21:02:45,500 copying flashinfer/rope.py -> build/lib/flashinfer 2026-04-14T21:02:45,503 copying flashinfer/activation.py -> build/lib/flashinfer 2026-04-14T21:02:45,505 copying flashinfer/artifacts.py -> build/lib/flashinfer 2026-04-14T21:02:45,507 copying flashinfer/topk.py -> build/lib/flashinfer 2026-04-14T21:02:45,510 copying flashinfer/green_ctx.py -> build/lib/flashinfer 2026-04-14T21:02:45,512 copying flashinfer/cascade.py -> build/lib/flashinfer 2026-04-14T21:02:45,515 copying flashinfer/aot.py -> build/lib/flashinfer 2026-04-14T21:02:45,518 copying flashinfer/__main__.py -> build/lib/flashinfer 2026-04-14T21:02:45,520 copying flashinfer/api_logging.py -> build/lib/flashinfer 2026-04-14T21:02:45,523 copying flashinfer/attention.py -> build/lib/flashinfer 2026-04-14T21:02:45,525 copying flashinfer/tllm_utils.py -> build/lib/flashinfer 2026-04-14T21:02:45,527 copying flashinfer/utils.py -> build/lib/flashinfer 2026-04-14T21:02:45,530 creating build/lib/flashinfer/cudnn 2026-04-14T21:02:45,531 copying flashinfer/cudnn/__init__.py -> build/lib/flashinfer/cudnn 2026-04-14T21:02:45,534 copying flashinfer/cudnn/decode.py -> build/lib/flashinfer/cudnn 2026-04-14T21:02:45,536 copying flashinfer/cudnn/prefill.py -> build/lib/flashinfer/cudnn 2026-04-14T21:02:45,539 copying flashinfer/cudnn/utils.py -> build/lib/flashinfer/cudnn 2026-04-14T21:02:45,542 creating build/lib/flashinfer/norm 2026-04-14T21:02:45,543 copying flashinfer/norm/__init__.py -> build/lib/flashinfer/norm 2026-04-14T21:02:45,546 copying flashinfer/norm/utils.py -> build/lib/flashinfer/norm 2026-04-14T21:02:45,549 creating build/lib/flashinfer/logits_processor 2026-04-14T21:02:45,550 copying flashinfer/logits_processor/pipeline.py -> build/lib/flashinfer/logits_processor 2026-04-14T21:02:45,552 copying flashinfer/logits_processor/legalization.py -> build/lib/flashinfer/logits_processor 2026-04-14T21:02:45,554 copying flashinfer/logits_processor/fusion_rules.py -> build/lib/flashinfer/logits_processor 2026-04-14T21:02:45,556 copying flashinfer/logits_processor/__init__.py -> build/lib/flashinfer/logits_processor 2026-04-14T21:02:45,558 copying flashinfer/logits_processor/operators.py -> build/lib/flashinfer/logits_processor 2026-04-14T21:02:45,561 copying flashinfer/logits_processor/validators.py -> build/lib/flashinfer/logits_processor 2026-04-14T21:02:45,563 copying flashinfer/logits_processor/op.py -> build/lib/flashinfer/logits_processor 2026-04-14T21:02:45,565 copying flashinfer/logits_processor/compiler.py -> build/lib/flashinfer/logits_processor 2026-04-14T21:02:45,567 copying flashinfer/logits_processor/processors.py -> build/lib/flashinfer/logits_processor 2026-04-14T21:02:45,570 copying flashinfer/logits_processor/types.py -> build/lib/flashinfer/logits_processor 2026-04-14T21:02:45,572 creating build/lib/flashinfer/fused_moe 2026-04-14T21:02:45,573 copying flashinfer/fused_moe/__init__.py -> build/lib/flashinfer/fused_moe 2026-04-14T21:02:45,576 copying flashinfer/fused_moe/fused_routing_dsv3.py -> build/lib/flashinfer/fused_moe 2026-04-14T21:02:45,578 copying flashinfer/fused_moe/core.py -> build/lib/flashinfer/fused_moe 2026-04-14T21:02:45,582 copying flashinfer/fused_moe/utils.py -> build/lib/flashinfer/fused_moe 2026-04-14T21:02:45,585 creating build/lib/flashinfer/cute_dsl 2026-04-14T21:02:45,586 copying flashinfer/cute_dsl/__init__.py -> build/lib/flashinfer/cute_dsl 2026-04-14T21:02:45,588 copying flashinfer/cute_dsl/rmsnorm_fp4quant.py -> build/lib/flashinfer/cute_dsl 2026-04-14T21:02:45,591 copying flashinfer/cute_dsl/gemm_allreduce_two_shot.py -> build/lib/flashinfer/cute_dsl 2026-04-14T21:02:45,595 copying flashinfer/cute_dsl/blockscaled_gemm.py -> build/lib/flashinfer/cute_dsl 2026-04-14T21:02:45,597 copying flashinfer/cute_dsl/fp4_common.py -> build/lib/flashinfer/cute_dsl 2026-04-14T21:02:45,600 copying flashinfer/cute_dsl/add_rmsnorm_fp4quant.py -> build/lib/flashinfer/cute_dsl 2026-04-14T21:02:45,603 copying flashinfer/cute_dsl/utils.py -> build/lib/flashinfer/cute_dsl 2026-04-14T21:02:45,606 creating build/lib/flashinfer/gdn_kernels 2026-04-14T21:02:45,607 copying flashinfer/gdn_kernels/__init__.py -> build/lib/flashinfer/gdn_kernels 2026-04-14T21:02:45,610 copying flashinfer/gdn_kernels/gdn_decode_nontranspose.py -> build/lib/flashinfer/gdn_kernels 2026-04-14T21:02:45,613 copying flashinfer/gdn_kernels/gdn_decode_pretranspose.py -> build/lib/flashinfer/gdn_kernels 2026-04-14T21:02:45,616 copying flashinfer/gdn_kernels/gdn_decode_bf16_state.py -> build/lib/flashinfer/gdn_kernels 2026-04-14T21:02:45,620 copying flashinfer/gdn_kernels/gdn_decode_mtp.py -> build/lib/flashinfer/gdn_kernels 2026-04-14T21:02:45,626 creating build/lib/flashinfer/testing 2026-04-14T21:02:45,627 copying flashinfer/testing/__init__.py -> build/lib/flashinfer/testing 2026-04-14T21:02:45,629 copying flashinfer/testing/utils.py -> build/lib/flashinfer/testing 2026-04-14T21:02:45,633 creating build/lib/flashinfer/tuning_configs 2026-04-14T21:02:45,634 copying flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_B200.py -> build/lib/flashinfer/tuning_configs 2026-04-14T21:02:45,637 copying flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_GB200.py -> build/lib/flashinfer/tuning_configs 2026-04-14T21:02:45,640 creating build/lib/flashinfer/data 2026-04-14T21:02:45,641 copying ./build_backend.py -> build/lib/flashinfer/data 2026-04-14T21:02:45,643 copying ./build_utils.py -> build/lib/flashinfer/data 2026-04-14T21:02:45,646 creating build/lib/flashinfer/gemm 2026-04-14T21:02:45,647 copying flashinfer/gemm/__init__.py -> build/lib/flashinfer/gemm 2026-04-14T21:02:45,649 copying flashinfer/gemm/gemm_base.py -> build/lib/flashinfer/gemm 2026-04-14T21:02:45,656 copying flashinfer/gemm/routergemm.py -> build/lib/flashinfer/gemm 2026-04-14T21:02:45,659 creating build/lib/flashinfer/profiler 2026-04-14T21:02:45,660 copying flashinfer/profiler/__init__.py -> build/lib/flashinfer/profiler 2026-04-14T21:02:45,663 creating build/lib/flashinfer/comm 2026-04-14T21:02:45,664 copying flashinfer/comm/allreduce.py -> build/lib/flashinfer/comm 2026-04-14T21:02:45,667 copying flashinfer/comm/dlpack_utils.py -> build/lib/flashinfer/comm 2026-04-14T21:02:45,669 copying flashinfer/comm/mapping.py -> build/lib/flashinfer/comm 2026-04-14T21:02:45,672 copying flashinfer/comm/trtllm_moe_alltoall.py -> build/lib/flashinfer/comm 2026-04-14T21:02:45,675 copying flashinfer/comm/__init__.py -> build/lib/flashinfer/comm 2026-04-14T21:02:45,677 copying flashinfer/comm/vllm_ar.py -> build/lib/flashinfer/comm 2026-04-14T21:02:45,679 copying flashinfer/comm/workspace_base.py -> build/lib/flashinfer/comm 2026-04-14T21:02:45,681 copying flashinfer/comm/trtllm_ar.py -> build/lib/flashinfer/comm 2026-04-14T21:02:45,684 copying flashinfer/comm/nvshmem.py -> build/lib/flashinfer/comm 2026-04-14T21:02:45,686 copying flashinfer/comm/mnnvl.py -> build/lib/flashinfer/comm 2026-04-14T21:02:45,689 copying flashinfer/comm/nvshmem_allreduce.py -> build/lib/flashinfer/comm 2026-04-14T21:02:45,692 copying flashinfer/comm/trtllm_mnnvl_ar.py -> build/lib/flashinfer/comm 2026-04-14T21:02:45,695 copying flashinfer/comm/cuda_ipc.py -> build/lib/flashinfer/comm 2026-04-14T21:02:45,697 copying flashinfer/comm/trtllm_alltoall.py -> build/lib/flashinfer/comm 2026-04-14T21:02:45,700 creating build/lib/flashinfer/triton 2026-04-14T21:02:45,701 copying flashinfer/triton/norm.py -> build/lib/flashinfer/triton 2026-04-14T21:02:45,703 copying flashinfer/triton/__init__.py -> build/lib/flashinfer/triton 2026-04-14T21:02:45,705 copying flashinfer/triton/gemm.py -> build/lib/flashinfer/triton 2026-04-14T21:02:45,707 copying flashinfer/triton/sm_constraint_gemm.py -> build/lib/flashinfer/triton 2026-04-14T21:02:45,709 copying flashinfer/triton/page.py -> build/lib/flashinfer/triton 2026-04-14T21:02:45,711 copying flashinfer/triton/activation.py -> build/lib/flashinfer/triton 2026-04-14T21:02:45,713 copying flashinfer/triton/cascade.py -> build/lib/flashinfer/triton 2026-04-14T21:02:45,715 copying flashinfer/triton/utils.py -> build/lib/flashinfer/triton 2026-04-14T21:02:45,717 creating build/lib/flashinfer/mla 2026-04-14T21:02:45,718 copying flashinfer/mla/__init__.py -> build/lib/flashinfer/mla 2026-04-14T21:02:45,720 copying flashinfer/mla/_core.py -> build/lib/flashinfer/mla 2026-04-14T21:02:45,723 creating build/lib/flashinfer/dsv3_ops 2026-04-14T21:02:45,724 copying flashinfer/dsv3_ops/__init__.py -> build/lib/flashinfer/dsv3_ops 2026-04-14T21:02:45,727 creating build/lib/flashinfer/mamba 2026-04-14T21:02:45,728 copying flashinfer/mamba/__init__.py -> build/lib/flashinfer/mamba 2026-04-14T21:02:45,730 copying flashinfer/mamba/ssd_tile_scheduler.py -> build/lib/flashinfer/mamba 2026-04-14T21:02:45,732 copying flashinfer/mamba/ssd_kernel.py -> build/lib/flashinfer/mamba 2026-04-14T21:02:45,737 copying flashinfer/mamba/selective_state_update.py -> build/lib/flashinfer/mamba 2026-04-14T21:02:45,739 copying flashinfer/mamba/ssd_combined.py -> build/lib/flashinfer/mamba 2026-04-14T21:02:45,743 creating build/lib/flashinfer/jit 2026-04-14T21:02:45,744 copying flashinfer/jit/sampling.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,746 copying flashinfer/jit/norm.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,748 copying flashinfer/jit/comm.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,750 copying flashinfer/jit/fp4_kv_quantization.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,752 copying flashinfer/jit/__init__.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,754 copying flashinfer/jit/fused_moe.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,757 copying flashinfer/jit/fp8_quantization.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,759 copying flashinfer/jit/mla.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,761 copying flashinfer/jit/quantization.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,763 copying flashinfer/jit/xqa.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,765 copying flashinfer/jit/gdn.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,767 copying flashinfer/jit/spdlog.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,769 copying flashinfer/jit/moe_utils.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,771 copying flashinfer/jit/fp4_quantization.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,773 copying flashinfer/jit/cpp_ext.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,776 copying flashinfer/jit/page.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,778 copying flashinfer/jit/fp4_kv_dequantization.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,779 copying flashinfer/jit/rope.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,781 copying flashinfer/jit/activation.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,784 copying flashinfer/jit/env.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,786 copying flashinfer/jit/topk.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,788 copying flashinfer/jit/cascade.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,790 copying flashinfer/jit/tinygemm2.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,792 copying flashinfer/jit/core.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,795 copying flashinfer/jit/cubin_loader.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,797 copying flashinfer/jit/tllm_utils.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,799 copying flashinfer/jit/dsv3_optimizations.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,801 copying flashinfer/jit/rmsnorm_silu.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,804 copying flashinfer/jit/utils.py -> build/lib/flashinfer/jit 2026-04-14T21:02:45,807 creating build/lib/flashinfer/quantization 2026-04-14T21:02:45,808 copying flashinfer/quantization/__init__.py -> build/lib/flashinfer/quantization 2026-04-14T21:02:45,811 copying flashinfer/quantization/fp8_quantization.py -> build/lib/flashinfer/quantization 2026-04-14T21:02:45,813 copying flashinfer/quantization/fp4_quantization.py -> build/lib/flashinfer/quantization 2026-04-14T21:02:45,817 copying flashinfer/quantization/packbits.py -> build/lib/flashinfer/quantization 2026-04-14T21:02:45,820 copying flashinfer/quantization/quantization_cute_dsl_utils.py -> build/lib/flashinfer/quantization 2026-04-14T21:02:45,824 creating build/lib/flashinfer/norm/kernels 2026-04-14T21:02:45,825 copying flashinfer/norm/kernels/__init__.py -> build/lib/flashinfer/norm/kernels 2026-04-14T21:02:45,827 copying flashinfer/norm/kernels/layernorm.py -> build/lib/flashinfer/norm/kernels 2026-04-14T21:02:45,830 copying flashinfer/norm/kernels/fused_add_rmsnorm.py -> build/lib/flashinfer/norm/kernels 2026-04-14T21:02:45,834 copying flashinfer/norm/kernels/rmsnorm.py -> build/lib/flashinfer/norm/kernels 2026-04-14T21:02:45,838 creating build/lib/flashinfer/fused_moe/cute_dsl 2026-04-14T21:02:45,839 copying flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_grouped_gemm_finalize_fusion.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-14T21:02:45,842 copying flashinfer/fused_moe/cute_dsl/__init__.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-14T21:02:45,844 copying flashinfer/fused_moe/cute_dsl/fused_moe.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-14T21:02:45,848 copying flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-14T21:02:45,851 copying flashinfer/fused_moe/cute_dsl/tuner.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-14T21:02:45,854 copying flashinfer/fused_moe/cute_dsl/moe_utils.py -> build/lib/flashinfer/fused_moe/cute_dsl 2026-04-14T21:02:45,857 creating build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-14T21:02:45,859 copying flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_grouped_gemm_finalize_fusion.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-14T21:02:45,864 copying flashinfer/fused_moe/cute_dsl/blackwell/__init__.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-14T21:02:45,866 copying flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-14T21:02:45,872 copying flashinfer/fused_moe/cute_dsl/blackwell/custom_pipeline.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-14T21:02:45,874 copying flashinfer/fused_moe/cute_dsl/blackwell/utils.py -> build/lib/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-14T21:02:45,878 creating build/lib/flashinfer/cute_dsl/attention 2026-04-14T21:02:45,879 copying flashinfer/cute_dsl/attention/pipeline_topology.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-14T21:02:45,882 copying flashinfer/cute_dsl/attention/__init__.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-14T21:02:45,885 copying flashinfer/cute_dsl/attention/tmem_layout.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-14T21:02:45,888 copying flashinfer/cute_dsl/attention/mla_decode_fp8.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-14T21:02:45,891 copying flashinfer/cute_dsl/attention/config.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-14T21:02:45,894 copying flashinfer/cute_dsl/attention/mla_warp_schedule.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-14T21:02:45,896 copying flashinfer/cute_dsl/attention/warp_schedule.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-14T21:02:45,899 copying flashinfer/cute_dsl/attention/collective_builder.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-14T21:02:45,902 copying flashinfer/cute_dsl/attention/mla_config.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-14T21:02:45,905 copying flashinfer/cute_dsl/attention/prefill.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-14T21:02:45,908 copying flashinfer/cute_dsl/attention/compat.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-14T21:02:45,910 copying flashinfer/cute_dsl/attention/mla_decode.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-14T21:02:45,913 copying flashinfer/cute_dsl/attention/mainloop_spec.py -> build/lib/flashinfer/cute_dsl/attention 2026-04-14T21:02:45,917 creating build/lib/flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:45,918 copying flashinfer/cute_dsl/attention/roles/__init__.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:45,920 copying flashinfer/cute_dsl/attention/roles/softmax.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:45,923 copying flashinfer/cute_dsl/attention/roles/correction.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:45,927 copying flashinfer/cute_dsl/attention/roles/mla_mma.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:45,929 copying flashinfer/cute_dsl/attention/roles/softmax_math.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:45,932 copying flashinfer/cute_dsl/attention/roles/loader_tma.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:45,935 copying flashinfer/cute_dsl/attention/roles/epilogue.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:45,937 copying flashinfer/cute_dsl/attention/roles/mla_loader.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:45,940 copying flashinfer/cute_dsl/attention/roles/mla_pt_loader.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:45,943 copying flashinfer/cute_dsl/attention/roles/mla_correction.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:45,946 copying flashinfer/cute_dsl/attention/roles/mla_mma_fp8.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:45,949 copying flashinfer/cute_dsl/attention/roles/mla_compute.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:45,952 copying flashinfer/cute_dsl/attention/roles/mma.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:45,955 copying flashinfer/cute_dsl/attention/roles/mla_loader_fp8.py -> build/lib/flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:45,959 creating build/lib/flashinfer/cute_dsl/attention/fusion 2026-04-14T21:02:45,960 copying flashinfer/cute_dsl/attention/fusion/__init__.py -> build/lib/flashinfer/cute_dsl/attention/fusion 2026-04-14T21:02:45,962 copying flashinfer/cute_dsl/attention/fusion/mask.py -> build/lib/flashinfer/cute_dsl/attention/fusion 2026-04-14T21:02:45,965 copying flashinfer/cute_dsl/attention/fusion/variant.py -> build/lib/flashinfer/cute_dsl/attention/fusion 2026-04-14T21:02:45,969 creating build/lib/flashinfer/cute_dsl/attention/wrappers 2026-04-14T21:02:45,970 copying flashinfer/cute_dsl/attention/wrappers/batch_prefill.py -> build/lib/flashinfer/cute_dsl/attention/wrappers 2026-04-14T21:02:45,973 copying flashinfer/cute_dsl/attention/wrappers/__init__.py -> build/lib/flashinfer/cute_dsl/attention/wrappers 2026-04-14T21:02:45,976 copying flashinfer/cute_dsl/attention/wrappers/batch_mla.py -> build/lib/flashinfer/cute_dsl/attention/wrappers 2026-04-14T21:02:45,979 creating build/lib/flashinfer/cute_dsl/attention/scheduler 2026-04-14T21:02:45,980 copying flashinfer/cute_dsl/attention/scheduler/mla_persistent.py -> build/lib/flashinfer/cute_dsl/attention/scheduler 2026-04-14T21:02:45,983 copying flashinfer/cute_dsl/attention/scheduler/__init__.py -> build/lib/flashinfer/cute_dsl/attention/scheduler 2026-04-14T21:02:45,985 copying flashinfer/cute_dsl/attention/scheduler/persistent.py -> build/lib/flashinfer/cute_dsl/attention/scheduler 2026-04-14T21:02:45,988 creating build/lib/flashinfer/gdn_kernels/blackwell 2026-04-14T21:02:45,989 copying flashinfer/gdn_kernels/blackwell/gated_delta_net_chunked.py -> build/lib/flashinfer/gdn_kernels/blackwell 2026-04-14T21:02:45,995 copying flashinfer/gdn_kernels/blackwell/gdn_prefill.py -> build/lib/flashinfer/gdn_kernels/blackwell 2026-04-14T21:02:45,997 copying flashinfer/gdn_kernels/blackwell/__init__.py -> build/lib/flashinfer/gdn_kernels/blackwell 2026-04-14T21:02:46,000 copying flashinfer/gdn_kernels/blackwell/gated_delta_net_tile_scheduler.py -> build/lib/flashinfer/gdn_kernels/blackwell 2026-04-14T21:02:46,005 creating build/lib/flashinfer/data/spdlog/scripts 2026-04-14T21:02:46,007 copying 3rdparty/spdlog/scripts/extract_version.py -> build/lib/flashinfer/data/spdlog/scripts 2026-04-14T21:02:46,012 creating build/lib/flashinfer/data/cutlass/python 2026-04-14T21:02:46,014 copying 3rdparty/cutlass/python/setup_pycute.py -> build/lib/flashinfer/data/cutlass/python 2026-04-14T21:02:46,016 copying 3rdparty/cutlass/python/setup_library.py -> build/lib/flashinfer/data/cutlass/python 2026-04-14T21:02:46,018 copying 3rdparty/cutlass/python/setup_cutlass.py -> build/lib/flashinfer/data/cutlass/python 2026-04-14T21:02:46,021 creating build/lib/flashinfer/data/cutlass/test/utils 2026-04-14T21:02:46,022 copying 3rdparty/cutlass/test/utils/test_sharding.py -> build/lib/flashinfer/data/cutlass/test/utils 2026-04-14T21:02:46,026 creating build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-14T21:02:46,028 copying 3rdparty/cutlass/test/python/pycute/test_complement.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-14T21:02:46,031 copying 3rdparty/cutlass/test/python/pycute/test_int_tuple.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-14T21:02:46,032 copying 3rdparty/cutlass/test/python/pycute/test_composition.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-14T21:02:46,034 copying 3rdparty/cutlass/test/python/pycute/test_typing.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-14T21:02:46,036 copying 3rdparty/cutlass/test/python/pycute/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-14T21:02:46,038 copying 3rdparty/cutlass/test/python/pycute/test_left_inverse.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-14T21:02:46,039 copying 3rdparty/cutlass/test/python/pycute/test_right_inverse.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-14T21:02:46,041 copying 3rdparty/cutlass/test/python/pycute/test_coalesce.py -> build/lib/flashinfer/data/cutlass/test/python/pycute 2026-04-14T21:02:46,043 creating build/lib/flashinfer/data/cutlass/test/python/cutlass 2026-04-14T21:02:46,044 copying 3rdparty/cutlass/test/python/cutlass/installation.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass 2026-04-14T21:02:46,047 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:46,048 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f32_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:46,050 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f64_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:46,052 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_s8_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:46,054 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_s8_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:46,056 copying 3rdparty/cutlass/test/python/cutlass/gemm/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:46,058 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_mixed_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:46,059 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_batched.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:46,061 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f16_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:46,063 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f64_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:46,065 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_testbed.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:46,068 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f8_sm90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:46,070 copying 3rdparty/cutlass/test/python/cutlass/gemm/gemm_f16_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:46,072 copying 3rdparty/cutlass/test/python/cutlass/gemm/utils.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:46,075 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-14T21:02:46,076 copying 3rdparty/cutlass/test/python/cutlass/interface/gemm_interface.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-14T21:02:46,078 copying 3rdparty/cutlass/test/python/cutlass/interface/conv2d_interface.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-14T21:02:46,080 copying 3rdparty/cutlass/test/python/cutlass/interface/evt_interface.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-14T21:02:46,083 copying 3rdparty/cutlass/test/python/cutlass/interface/utils.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-14T21:02:46,085 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-14T21:02:46,086 copying 3rdparty/cutlass/test/python/cutlass/evt/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-14T21:02:46,088 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_load_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-14T21:02:46,090 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_store_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-14T21:02:46,092 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_layout_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-14T21:02:46,094 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_mixed_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-14T21:02:46,097 copying 3rdparty/cutlass/test/python/cutlass/evt/evt_compute_sm80_90.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-14T21:02:46,099 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/emit 2026-04-14T21:02:46,100 copying 3rdparty/cutlass/test/python/cutlass/emit/pytorch.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/emit 2026-04-14T21:02:46,104 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-14T21:02:46,105 copying 3rdparty/cutlass/test/python/cutlass/conv2d/conv2d_test_utils.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-14T21:02:46,108 copying 3rdparty/cutlass/test/python/cutlass/conv2d/conv2d_sm80.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-14T21:02:46,111 copying 3rdparty/cutlass/test/python/cutlass/conv2d/run_all_tests.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-14T21:02:46,113 copying 3rdparty/cutlass/test/python/cutlass/conv2d/conv2d_problem_sizes.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-14T21:02:46,117 creating build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/utils 2026-04-14T21:02:46,118 copying 3rdparty/cutlass/test/python/cutlass/evt/utils/evt_testbed.py -> build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/utils 2026-04-14T21:02:46,122 creating build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL 2026-04-14T21:02:46,124 copying 3rdparty/cutlass/test/examples/CuTeDSL/conftest.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL 2026-04-14T21:02:46,127 creating build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-14T21:02:46,129 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/conftest.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-14T21:02:46,131 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/test_rmsnorm.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-14T21:02:46,133 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_gemm_persistent_prefetch.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-14T21:02:46,136 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_blockscaled_gemm_persistent_prefetch.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-14T21:02:46,139 copying 3rdparty/cutlass/test/examples/CuTeDSL/sm_100a/test_tutorial_gemm.py -> build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-14T21:02:46,142 creating build/lib/flashinfer/data/cutlass/test/unit/gemm/device 2026-04-14T21:02:46,144 copying 3rdparty/cutlass/test/unit/gemm/device/simt_sm50.py -> build/lib/flashinfer/data/cutlass/test/unit/gemm/device 2026-04-14T21:02:46,148 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-14T21:02:46,149 copying 3rdparty/cutlass/python/cutlass_cppgen/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-14T21:02:46,152 copying 3rdparty/cutlass/python/cutlass_cppgen/swizzle.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-14T21:02:46,154 copying 3rdparty/cutlass/python/cutlass_cppgen/shape.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-14T21:02:46,157 copying 3rdparty/cutlass/python/cutlass_cppgen/library_defaults.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-14T21:02:46,160 creating build/lib/flashinfer/data/cutlass/python/pycute 2026-04-14T21:02:46,162 copying 3rdparty/cutlass/python/pycute/__init__.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-14T21:02:46,164 copying 3rdparty/cutlass/python/pycute/swizzle.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-14T21:02:46,166 copying 3rdparty/cutlass/python/pycute/layout.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-14T21:02:46,169 copying 3rdparty/cutlass/python/pycute/typing.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-14T21:02:46,172 copying 3rdparty/cutlass/python/pycute/int_tuple.py -> build/lib/flashinfer/data/cutlass/python/pycute 2026-04-14T21:02:46,175 creating build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:46,177 copying 3rdparty/cutlass/python/cutlass_library/generator.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:46,190 copying 3rdparty/cutlass/python/cutlass_library/rank_k_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:46,193 copying 3rdparty/cutlass/python/cutlass_library/trmm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:46,196 copying 3rdparty/cutlass/python/cutlass_library/emit_kernel_listing.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:46,200 copying 3rdparty/cutlass/python/cutlass_library/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:46,202 copying 3rdparty/cutlass/python/cutlass_library/conv3x_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:46,204 copying 3rdparty/cutlass/python/cutlass_library/conv2d_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:46,207 copying 3rdparty/cutlass/python/cutlass_library/manifest.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:46,210 copying 3rdparty/cutlass/python/cutlass_library/sm100_shapes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:46,212 copying 3rdparty/cutlass/python/cutlass_library/library.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:46,215 copying 3rdparty/cutlass/python/cutlass_library/sm90_utils.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:46,218 copying 3rdparty/cutlass/python/cutlass_library/heuristics.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:46,220 copying 3rdparty/cutlass/python/cutlass_library/sm100_utils.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:46,223 copying 3rdparty/cutlass/python/cutlass_library/gemm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:46,227 copying 3rdparty/cutlass/python/cutlass_library/sm90_shapes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:46,229 copying 3rdparty/cutlass/python/cutlass_library/heuristics_provider.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:46,231 copying 3rdparty/cutlass/python/cutlass_library/symm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:46,233 copying 3rdparty/cutlass/python/cutlass_library/rank_2k_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:46,235 copying 3rdparty/cutlass/python/cutlass_library/conv3d_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:46,238 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL 2026-04-14T21:02:46,239 copying 3rdparty/cutlass/python/CuTeDSL/prep_editable_install.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL 2026-04-14T21:02:46,242 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-14T21:02:46,243 copying 3rdparty/cutlass/python/cutlass_cppgen/epilogue/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-14T21:02:46,245 copying 3rdparty/cutlass/python/cutlass_cppgen/epilogue/evt_ops.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-14T21:02:46,247 copying 3rdparty/cutlass/python/cutlass_cppgen/epilogue/epilogue.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-14T21:02:46,250 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-14T21:02:46,251 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-14T21:02:46,253 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/profiler.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-14T21:02:46,255 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/check.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-14T21:02:46,257 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/lazy_import.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-14T21:02:46,259 copying 3rdparty/cutlass/python/cutlass_cppgen/utils/datatypes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-14T21:02:46,262 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:46,263 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:46,265 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/memory_manager.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:46,267 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:46,269 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/conv2d_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:46,272 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/library.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:46,274 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/epilogue.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:46,277 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/compiler.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:46,279 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/gemm_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:46,283 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/frontend.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:46,285 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/c_types.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:46,287 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/type_hint.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:46,289 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/reduction_operation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:46,292 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/arguments.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:46,294 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-14T21:02:46,295 copying 3rdparty/cutlass/python/cutlass_cppgen/op/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-14T21:02:46,297 copying 3rdparty/cutlass/python/cutlass_cppgen/op/gemm.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-14T21:02:46,300 copying 3rdparty/cutlass/python/cutlass_cppgen/op/gemm_grouped.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-14T21:02:46,302 copying 3rdparty/cutlass/python/cutlass_cppgen/op/op.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-14T21:02:46,305 copying 3rdparty/cutlass/python/cutlass_cppgen/op/conv.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-14T21:02:46,308 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-14T21:02:46,309 copying 3rdparty/cutlass/python/cutlass_cppgen/emit/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-14T21:02:46,311 copying 3rdparty/cutlass/python/cutlass_cppgen/emit/pytorch.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-14T21:02:46,314 copying 3rdparty/cutlass/python/cutlass_cppgen/emit/common.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-14T21:02:46,317 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-14T21:02:46,318 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-14T21:02:46,321 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/utils/device.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-14T21:02:46,324 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-14T21:02:46,325 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-14T21:02:46,327 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/epilogue.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-14T21:02:46,330 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-14T21:02:46,331 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-14T21:02:46,333 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-14T21:02:46,335 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-14T21:02:46,337 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/emitter_base.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-14T21:02:46,339 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_emitter.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-14T21:02:46,341 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-14T21:02:46,343 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-14T21:02:46,346 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-14T21:02:46,349 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:46,350 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_shape_type_propagation.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:46,352 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_fix_element_d.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:46,354 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:46,355 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/util.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:46,357 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_argument_type.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:46,359 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/smem_size_calculator.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:46,362 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_layout_elimination.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:46,364 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_get_impl.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:46,366 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/graph_drawer.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:46,368 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_dag_2_tree.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:46,370 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_manager.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:46,372 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_no_op_elimination.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:46,374 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_preprocess_red.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:46,376 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-14T21:02:46,378 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-14T21:02:46,380 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/load_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-14T21:02:46,382 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/compute_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-14T21:02:46,383 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/store_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-14T21:02:46,385 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/node.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-14T21:02:46,388 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/dag_ir.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-14T21:02:46,390 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_algorithm.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-14T21:02:46,392 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_nodes.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-14T21:02:46,394 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/ir/tensor.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-14T21:02:46,397 creating build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-14T21:02:46,398 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/frontend/__init__.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-14T21:02:46,400 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/frontend/frontend_base.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-14T21:02:46,402 copying 3rdparty/cutlass/python/cutlass_cppgen/backend/evt/frontend/python_ast.py -> build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-14T21:02:46,405 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-14T21:02:46,405 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-14T21:02:46,407 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/torch.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-14T21:02:46,410 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/impl_utils.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-14T21:02:46,412 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:46,413 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/hopper_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:46,416 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:46,418 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/print_latex.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:46,420 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_tile_scheduler_helper.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:46,423 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/distributed.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:46,425 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/tensormap_manager.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:46,427 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_persistent_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:46,431 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/layout.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:46,433 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/blockscaled_layout.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:46,436 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/tmem_allocator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:46,439 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/tensor_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:46,440 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/hardware_info.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:46,442 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/dynamic_persistent_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:46,445 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/blackwell_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:46,447 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/static_persistent_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:46,450 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/mixed_input_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:46,453 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/smem_allocator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:46,457 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:46,458 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:46,460 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/env_manager.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:46,463 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/arch.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:46,466 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/dsl.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:46,469 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/compiler.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:46,472 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/common.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:46,475 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/typing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:46,478 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:46,481 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/cache_helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:46,484 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/version_info.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:46,486 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/jit_executor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:46,489 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_preprocessor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:46,493 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:46,494 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/runtime.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:46,497 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/ffi.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:46,500 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:46,502 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/testing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:46,505 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/_tvm_ffi_args_spec_converter.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:46,508 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/tuple.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:46,510 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/algorithm.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:46,513 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/typing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:46,515 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/core.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:46,520 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/tensor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:46,523 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/atom.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:46,526 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/math.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:46,529 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-14T21:02:46,530 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/ffi.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-14T21:02:46,533 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-14T21:02:46,535 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/testing.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-14T21:02:46,537 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/primitive.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-14T21:02:46,539 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/compile.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-14T21:02:46,541 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/jax/types.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-14T21:02:46,544 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-14T21:02:46,545 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/sm100.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-14T21:02:46,548 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/sm90.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-14T21:02:46,551 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-14T21:02:46,553 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/pipeline/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-14T21:02:46,556 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-14T21:02:46,557 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-14T21:02:46,561 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/tvm_ffi_provider.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-14T21:02:46,564 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-14T21:02:46,566 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass_ast_decorators.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-14T21:02:46,568 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_stream_adapter.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-14T21:02:46,570 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_jit_executor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-14T21:02:46,573 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-14T21:02:46,574 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/gemm/sm100.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-14T21:02:46,577 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/utils/gemm/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-14T21:02:46,579 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-14T21:02:46,580 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-14T21:02:46,582 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/tree_utils.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-14T21:02:46,584 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/timer.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-14T21:02:46,586 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/numpy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-14T21:02:46,588 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/logger.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-14T21:02:46,590 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/stacktrace.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-14T21:02:46,593 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-14T21:02:46,594 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/cuda.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-14T21:02:46,597 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-14T21:02:46,598 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/tensor_descriptor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-14T21:02:46,601 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/device_tensor.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-14T21:02:46,603 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/dlpack_types.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-14T21:02:46,605 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/stream_adapter.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-14T21:02:46,607 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/jit_arg_adapters.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-14T21:02:46,609 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-14T21:02:46,610 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/call_provider.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-14T21:02:46,613 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/mlir_builder.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-14T21:02:46,616 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-14T21:02:46,618 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/spec.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-14T21:02:46,621 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/tvm_ffi_builder.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-14T21:02:46,625 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-14T21:02:46,626 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/export/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-14T21:02:46,628 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/export/external_binary_module.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-14T21:02:46,630 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/export/export.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-14T21:02:46,632 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/export/c_header_generator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-14T21:02:46,635 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-14T21:02:46,636 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-14T21:02:46,639 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/op.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-14T21:02:46,641 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/lru_cache_ir.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-14T21:02:46,643 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/gpu.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-14T21:02:46,645 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/arith.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-14T21:02:46,648 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-14T21:02:46,650 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/pipeline.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-14T21:02:46,653 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-14T21:02:46,655 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/memory.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-14T21:02:46,658 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/algorithm.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-14T21:02:46,661 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/core.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-14T21:02:46,663 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/math.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-14T21:02:46,666 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/experimental/utils.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-14T21:02:46,669 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-14T21:02:46,671 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/tmem.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-14T21:02:46,674 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-14T21:02:46,676 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/numeric_conversion.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-14T21:02:46,679 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/smem.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-14T21:02:46,682 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/nvvm_wrappers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-14T21:02:46,686 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/mbar.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-14T21:02:46,689 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/elect.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-14T21:02:46,692 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/arch/clc.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-14T21:02:46,695 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-14T21:02:46,697 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-14T21:02:46,700 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-14T21:02:46,703 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/common.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-14T21:02:46,707 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-14T21:02:46,709 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-14T21:02:46,712 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/load.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-14T21:02:46,714 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/export.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-14T21:02:46,717 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/aot_config.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-14T21:02:46,720 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/export/c_header_generator.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-14T21:02:46,724 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-14T21:02:46,726 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/copy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-14T21:02:46,730 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-14T21:02:46,732 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/mma.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-14T21:02:46,736 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-14T21:02:46,738 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/copy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-14T21:02:46,742 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-14T21:02:46,745 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-14T21:02:46,748 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/mma.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-14T21:02:46,753 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-14T21:02:46,755 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/copy.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-14T21:02:46,759 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-14T21:02:46,762 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-14T21:02:46,765 creating build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-14T21:02:46,767 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/__init__.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-14T21:02:46,770 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/helpers.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-14T21:02:46,773 copying 3rdparty/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/mma.py -> build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-14T21:02:46,777 creating build/lib/flashinfer/data/cutlass/python/docs_src/source 2026-04-14T21:02:46,779 copying 3rdparty/cutlass/python/docs_src/source/conf.py -> build/lib/flashinfer/data/cutlass/python/docs_src/source 2026-04-14T21:02:46,784 creating build/lib/flashinfer/data/cutlass/tools/util/scripts 2026-04-14T21:02:46,787 copying 3rdparty/cutlass/tools/util/scripts/split_test_cmake.py -> build/lib/flashinfer/data/cutlass/tools/util/scripts 2026-04-14T21:02:46,797 creating build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-14T21:02:46,799 copying 3rdparty/cutlass/examples/40_cutlass_py/conv2d.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-14T21:02:46,803 copying 3rdparty/cutlass/examples/40_cutlass_py/gemm.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-14T21:02:46,806 copying 3rdparty/cutlass/examples/40_cutlass_py/gemm_grouped.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-14T21:02:46,810 creating build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-14T21:02:46,812 copying 3rdparty/cutlass/examples/41_fused_multi_head_attention/fmha_backward_test.py -> build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-14T21:02:46,815 copying 3rdparty/cutlass/examples/41_fused_multi_head_attention/piped_subprocess.py -> build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-14T21:02:46,819 creating build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:46,822 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_sample.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:46,825 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_device.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:46,828 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_verify.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:46,831 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_ir.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:46,834 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_kernel.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:46,837 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/replace_fix_impl_header.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:46,840 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_cmake.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:46,842 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_all_code.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:46,844 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_threadblock.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:46,847 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_turing_and_volta.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:46,850 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_customized_epilogue.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:46,852 copying 3rdparty/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/helper.py -> build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:46,855 creating build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-14T21:02:46,856 copying 3rdparty/cutlass/examples/40_cutlass_py/customizable/conv2d.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-14T21:02:46,859 copying 3rdparty/cutlass/examples/40_cutlass_py/customizable/gemm.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-14T21:02:46,861 copying 3rdparty/cutlass/examples/40_cutlass_py/customizable/gemm_grouped.py -> build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-14T21:02:46,866 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:46,868 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/flash_attention_v2.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:46,873 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/call_from_jit.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:46,875 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/tensorop_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:46,879 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/cooperative_launch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:46,882 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/elementwise_add.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:46,885 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/elementwise_add_autotune.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:46,887 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/sgemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:46,891 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/hstu_attention.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:46,894 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/elementwise_apply.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:46,897 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/dynamic_smem_size.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:46,898 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/call_bypass_dlpack.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:46,901 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/smem_allocator.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:46,903 copying 3rdparty/cutlass/examples/python/CuTeDSL/ampere/inline_ptx.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:46,906 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-14T21:02:46,907 copying 3rdparty/cutlass/examples/python/CuTeDSL/utils/__init__.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-14T21:02:46,909 copying 3rdparty/cutlass/examples/python/CuTeDSL/utils/fmha_helpers.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-14T21:02:46,913 copying 3rdparty/cutlass/examples/python/CuTeDSL/utils/sparse_utils.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-14T21:02:46,916 copying 3rdparty/cutlass/examples/python/CuTeDSL/utils/test_sparse_utils.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-14T21:02:46,919 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce 2026-04-14T21:02:46,920 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell_geforce/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce 2026-04-14T21:02:46,925 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-14T21:02:46,926 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/print_latex.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-14T21:02:46,929 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/torch_fake_tensor.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-14T21:02:46,931 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-14T21:02:46,933 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/cta_norm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-14T21:02:46,936 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/fmha.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-14T21:02:46,941 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/dense_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-14T21:02:46,945 copying 3rdparty/cutlass/examples/python/CuTeDSL/hopper/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-14T21:02:46,950 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-14T21:02:46,951 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/all_reduce_simple.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-14T21:02:46,954 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_reduce_scatter_blackwell.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-14T21:02:46,959 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_all_reduce_blackwell.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-14T21:02:46,965 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/all_reduce_tma.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-14T21:02:46,968 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/all_reduce_one_shot_lamport.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-14T21:02:46,971 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/all_reduce_two_shot_multimem.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-14T21:02:46,974 copying 3rdparty/cutlass/examples/python/CuTeDSL/distributed/distributed_all_gather_gemm_blackwell.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-14T21:02:46,979 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-14T21:02:46,980 copying 3rdparty/cutlass/examples/python/CuTeDSL/jax/cutlass_call_basic.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-14T21:02:46,983 copying 3rdparty/cutlass/examples/python/CuTeDSL/jax/elementwise_apply_example.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-14T21:02:46,986 copying 3rdparty/cutlass/examples/python/CuTeDSL/jax/cutlass_call_export.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-14T21:02:46,989 copying 3rdparty/cutlass/examples/python/CuTeDSL/jax/cutlass_call_sharding.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-14T21:02:46,992 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:46,994 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/fmha_bwd.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:46,999 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/grouped_blockscaled_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:47,005 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_software_pipeline.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:47,009 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:47,015 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mla.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:47,022 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_alpha_beta_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:47,027 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_amax.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:47,033 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/grouped_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:47,039 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/fmha.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:47,045 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_prefetch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:47,052 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_prefetch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:47,057 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/reduce.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:47,060 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/rmsnorm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:47,063 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/sm103_dense_blockscaled_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:47,070 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/programmatic_dependent_launch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:47,073 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:47,078 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_dynamic.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:47,082 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:47,088 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-14T21:02:47,090 copying 3rdparty/cutlass/examples/python/CuTeDSL/helpers/__init__.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-14T21:02:47,093 copying 3rdparty/cutlass/examples/python/CuTeDSL/helpers/fmha_helpers.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-14T21:02:47,098 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere 2026-04-14T21:02:47,100 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/ampere/memcpy_simt_universal_copy.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere 2026-04-14T21:02:47,104 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-14T21:02:47,106 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_cute_pipeline.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-14T21:02:47,111 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_2sm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-14T21:02:47,115 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_block_scaled_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-14T21:02:47,120 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_ptr_array.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-14T21:02:47,124 copying 3rdparty/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-14T21:02:47,131 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-14T21:02:47,133 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_torch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-14T21:02:47,136 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/error_reporting.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-14T21:02:47,139 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/compile_with_fake_tensor.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-14T21:02:47,142 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/ampere_gemm_with_fake_tensor.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-14T21:02:47,146 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_jax.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-14T21:02:47,149 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_export.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-14T21:02:47,152 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_torch.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-14T21:02:47,155 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_jax.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-14T21:02:47,158 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2026-04-14T21:02:47,160 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/ffi/jit_argument.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2026-04-14T21:02:47,164 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-14T21:02:47,166 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/export/load_in_python.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-14T21:02:47,169 copying 3rdparty/cutlass/examples/python/CuTeDSL/cute/export/export_to_c.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-14T21:02:47,173 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-14T21:02:47,175 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_host_utils.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-14T21:02:47,178 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm_acc_scale.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-14T21:02:47,183 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-14T21:02:47,188 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-14T21:02:47,194 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-14T21:02:47,195 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/activation_custom_epilogue_dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-14T21:02:47,198 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_efc.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-14T21:02:47,202 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/custom_epilogue_dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-14T21:02:47,206 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/synthetic_custom_epilogue_dense_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-14T21:02:47,209 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_dense_gemm_efc.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-14T21:02:47,214 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-14T21:02:47,215 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_0.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-14T21:02:47,218 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_1.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-14T21:02:47,221 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_1.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-14T21:02:47,223 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_0.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-14T21:02:47,226 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/utils.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-14T21:02:47,229 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-14T21:02:47,230 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/masked_grouped_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-14T21:02:47,234 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/contiguous_grouped_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-14T21:02:47,239 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/blockwise_gemm.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-14T21:02:47,244 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-14T21:02:47,246 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp16.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-14T21:02:47,253 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp8.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-14T21:02:47,258 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_helpers.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-14T21:02:47,261 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-14T21:02:47,262 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-14T21:02:47,267 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_tile_scheduler.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-14T21:02:47,269 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_reference.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-14T21:02:47,272 creating build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-14T21:02:47,273 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/prefill_helpers.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-14T21:02:47,276 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d512.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-14T21:02:47,281 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_decode.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-14T21:02:47,285 copying 3rdparty/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d256.py -> build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-14T21:02:47,345 creating build/lib/flashinfer/gemm/kernels 2026-04-14T21:02:47,347 copying flashinfer/gemm/kernels/grouped_gemm_masked_blackwell.py -> build/lib/flashinfer/gemm/kernels 2026-04-14T21:02:47,352 copying flashinfer/gemm/kernels/__init__.py -> build/lib/flashinfer/gemm/kernels 2026-04-14T21:02:47,354 copying flashinfer/gemm/kernels/dense_blockscaled_gemm_sm100.py -> build/lib/flashinfer/gemm/kernels 2026-04-14T21:02:47,359 copying flashinfer/gemm/kernels/dense_blockscaled_gemm_sm103.py -> build/lib/flashinfer/gemm/kernels 2026-04-14T21:02:47,364 creating build/lib/flashinfer/triton/kernels 2026-04-14T21:02:47,366 copying flashinfer/triton/kernels/norm.py -> build/lib/flashinfer/triton/kernels 2026-04-14T21:02:47,369 copying flashinfer/triton/kernels/ssd_chunk_state.py -> build/lib/flashinfer/triton/kernels 2026-04-14T21:02:47,372 copying flashinfer/triton/kernels/__init__.py -> build/lib/flashinfer/triton/kernels 2026-04-14T21:02:47,373 copying flashinfer/triton/kernels/sm_constraint_gemm.py -> build/lib/flashinfer/triton/kernels 2026-04-14T21:02:47,376 copying flashinfer/triton/kernels/activation.py -> build/lib/flashinfer/triton/kernels 2026-04-14T21:02:47,378 copying flashinfer/triton/kernels/quant.py -> build/lib/flashinfer/triton/kernels 2026-04-14T21:02:47,380 copying flashinfer/triton/kernels/cascade.py -> build/lib/flashinfer/triton/kernels 2026-04-14T21:02:47,383 creating build/lib/flashinfer/jit/gemm 2026-04-14T21:02:47,385 copying flashinfer/jit/gemm/fp8_blockscale.py -> build/lib/flashinfer/jit/gemm 2026-04-14T21:02:47,387 copying flashinfer/jit/gemm/__init__.py -> build/lib/flashinfer/jit/gemm 2026-04-14T21:02:47,389 copying flashinfer/jit/gemm/deepgemm.py -> build/lib/flashinfer/jit/gemm 2026-04-14T21:02:47,391 copying flashinfer/jit/gemm/core.py -> build/lib/flashinfer/jit/gemm 2026-04-14T21:02:47,394 creating build/lib/flashinfer/jit/mamba 2026-04-14T21:02:47,396 copying flashinfer/jit/mamba/__init__.py -> build/lib/flashinfer/jit/mamba 2026-04-14T21:02:47,398 copying flashinfer/jit/mamba/seq_chunk_cumsum.py -> build/lib/flashinfer/jit/mamba 2026-04-14T21:02:47,400 copying flashinfer/jit/mamba/selective_state_update.py -> build/lib/flashinfer/jit/mamba 2026-04-14T21:02:47,403 creating build/lib/flashinfer/jit/attention 2026-04-14T21:02:47,404 copying flashinfer/jit/attention/__init__.py -> build/lib/flashinfer/jit/attention 2026-04-14T21:02:47,406 copying flashinfer/jit/attention/modules.py -> build/lib/flashinfer/jit/attention 2026-04-14T21:02:47,410 copying flashinfer/jit/attention/variants.py -> build/lib/flashinfer/jit/attention 2026-04-14T21:02:47,412 copying flashinfer/jit/attention/utils.py -> build/lib/flashinfer/jit/attention 2026-04-14T21:02:47,415 creating build/lib/flashinfer/jit/gemm/cutlass 2026-04-14T21:02:47,416 copying flashinfer/jit/gemm/cutlass/cutlass_library.py -> build/lib/flashinfer/jit/gemm/cutlass 2026-04-14T21:02:47,419 copying flashinfer/jit/gemm/cutlass/__init__.py -> build/lib/flashinfer/jit/gemm/cutlass 2026-04-14T21:02:47,421 copying flashinfer/jit/gemm/cutlass/generate_kernels.py -> build/lib/flashinfer/jit/gemm/cutlass 2026-04-14T21:02:47,424 creating build/lib/flashinfer/jit/attention/fmha_v2 2026-04-14T21:02:47,426 copying flashinfer/jit/attention/fmha_v2/generate_kernels.py -> build/lib/flashinfer/jit/attention/fmha_v2 2026-04-14T21:02:47,428 copying flashinfer/jit/attention/fmha_v2/fmha_library.py -> build/lib/flashinfer/jit/attention/fmha_v2 2026-04-14T21:02:47,431 copying flashinfer/jit/attention/fmha_v2/generator_utils.py -> build/lib/flashinfer/jit/attention/fmha_v2 2026-04-14T21:02:47,438 copying flashinfer/jit/attention/fmha_v2/utils.py -> build/lib/flashinfer/jit/attention/fmha_v2 2026-04-14T21:02:47,441 creating build/lib/flashinfer/quantization/kernels 2026-04-14T21:02:47,443 copying flashinfer/quantization/kernels/mxfp4_quantize.py -> build/lib/flashinfer/quantization/kernels 2026-04-14T21:02:47,446 copying flashinfer/quantization/kernels/__init__.py -> build/lib/flashinfer/quantization/kernels 2026-04-14T21:02:47,448 copying flashinfer/quantization/kernels/nvfp4_quantize.py -> build/lib/flashinfer/quantization/kernels 2026-04-14T21:02:47,451 copying flashinfer/quantization/kernels/mxfp8_quantize.py -> build/lib/flashinfer/quantization/kernels 2026-04-14T21:02:48,017 copying flashinfer/py.typed -> build/lib/flashinfer 2026-04-14T21:02:48,019 creating build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,020 copying ./csrc/trtllm_batched_gemm_runner.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,023 copying ./csrc/norm.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,026 copying ./csrc/single_decode_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,028 copying ./csrc/fp4_gemm_cutlass_sm120.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,030 copying ./csrc/single_prefill_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,032 copying ./csrc/fmha_v2_run.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,034 copying ./csrc/batch_prefill_sm90_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,036 copying ./csrc/pod_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,038 copying ./csrc/tgv_gemm.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,041 copying ./csrc/batch_prefill_paged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,043 copying ./csrc/flashinfer_cascade_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,045 copying ./csrc/trtllm_alltoall_prepare.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,048 copying ./csrc/single_prefill_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,050 copying ./csrc/seq_chunk_cumsum.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,052 copying ./csrc/vllm_custom_all_reduce.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,054 copying ./csrc/batch_prefill_fp8_paged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,056 copying ./csrc/batch_mla_sm90_run.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,058 copying ./csrc/flashinfer_gemm_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,060 copying ./csrc/rope.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,063 copying ./csrc/batch_prefill_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,065 copying ./csrc/batch_attention_paged_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,067 copying ./csrc/gemm_sm120_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,069 copying ./csrc/sampling_utils.h -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,071 copying ./csrc/single_decode_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,073 copying ./csrc/group_gemm_sm100_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,075 copying ./csrc/tvm_ffi_utils.h -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,078 copying ./csrc/trtllm_low_latency_gemm_runner.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,080 copying ./csrc/selective_state_update_kernel_inst.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,083 copying ./csrc/batch_mla_run.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,085 copying ./csrc/batch_mla_sm90_plan.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,087 copying ./csrc/batch_pod_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,089 copying ./csrc/flashinfer_rope_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,091 copying ./csrc/batch_prefill_fp8_ragged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,093 copying ./csrc/gemm_sm100_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,095 copying ./csrc/group_gemm_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,097 copying ./csrc/blackwell_fmha_plan.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,099 copying ./csrc/batch_decode_mla_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,101 copying ./csrc/selective_state_update.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,104 copying ./csrc/fmhaReduction.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,107 copying ./csrc/batch_decode_mla_cute_sm80.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,109 copying ./csrc/rmsnorm_silu.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,111 creating build/lib/flashinfer/data/csrc/fused_moe 2026-04-14T21:02:48,112 copying ./csrc/fused_moe/noAuxTcKernels.cu -> build/lib/flashinfer/data/csrc/fused_moe 2026-04-14T21:02:48,115 creating build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-14T21:02:48,116 copying ./csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_binding.cu -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-14T21:02:48,120 copying ./csrc/fused_moe/cutlass_backend/deepgemm_jit_setup.cu -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-14T21:02:48,122 copying ./csrc/fused_moe/cutlass_backend/cutlass_fused_moe_kernels.cuh -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-14T21:02:48,129 copying ./csrc/fused_moe/cutlass_backend/cutlass_fused_moe_instantiation.cu -> build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-14T21:02:48,131 creating build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-14T21:02:48,132 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_llama4.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-14T21:02:48,136 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_deepseek.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-14T21:02:48,139 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_dev_kernel.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-14T21:02:48,142 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_common.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-14T21:02:48,144 copying ./csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_custom.cu -> build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-14T21:02:48,147 copying ./csrc/fused_moe/moeTopKFuncs.cuh -> build/lib/flashinfer/data/csrc/fused_moe 2026-04-14T21:02:48,150 copying ./csrc/renorm.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,152 copying ./csrc/mxfp8_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,155 copying ./csrc/cutlass_mla.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,157 copying ./csrc/page.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,159 copying ./csrc/single_prefill.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,162 copying ./csrc/fp8_blockscale_gemm_sm90_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,164 copying ./csrc/batch_attention_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,166 copying ./csrc/runtime_utils.h -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,168 copying ./csrc/single_prefill_fp8_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,170 copying ./csrc/group_gemm_sm120_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,172 copying ./csrc/single_prefill_sm90_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,174 copying ./csrc/flashinfer_xqa_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,176 copying ./csrc/flashinfer_quantization_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,178 copying ./csrc/batch_pod_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,181 copying ./csrc/selective_state_update_dtype_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,183 copying ./csrc/trtllm_moe_alltoall.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,186 copying ./csrc/gdn_prefill_launcher.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,188 copying ./csrc/bf16_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,190 copying ./csrc/batch_pod.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,193 copying ./csrc/group_gemm_mxfp4_groupwise_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,195 copying ./csrc/batch_prefill_ragged_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,197 copying ./csrc/moe_utils_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,200 copying ./csrc/concat_mla.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,202 copying ./csrc/flashinfer_rmsnorm_silu_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,204 copying ./csrc/batch_decode_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,206 copying ./csrc/group_gemm_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,208 copying ./csrc/group_gemm_nvfp4_groupwise_sm120_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,211 copying ./csrc/flashinfer_page_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,213 copying ./csrc/batch_pod_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,215 copying ./csrc/group_gemm_mxfp4_groupwise_sm100.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,218 copying ./csrc/trtllm_fused_moe_kernel_launcher.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,222 copying ./csrc/flashinfer_topk_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,224 copying ./csrc/mxfp8_gemm_cutlass_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,227 copying ./csrc/prefill_kernel_delta_rule_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,229 creating build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,230 copying ./csrc/xqa/barriers.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,233 copying ./csrc/xqa/utils.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,235 copying ./csrc/xqa/tensorMap.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,237 copying ./csrc/xqa/mla_sm120.cu -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,240 copying ./csrc/xqa/defines.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,243 copying ./csrc/xqa/platform.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,245 copying ./csrc/xqa/utils.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,248 copying ./csrc/xqa/mhaUtils.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,251 copying ./csrc/xqa/gmma.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,254 copying ./csrc/xqa/mha.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,257 copying ./csrc/xqa/mha_components.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,259 copying ./csrc/xqa/ldgsts.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,262 copying ./csrc/xqa/mha.cu -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,266 copying ./csrc/xqa/tensorMap.cpp -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,268 copying ./csrc/xqa/mha_stdheaders.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,271 copying ./csrc/xqa/mha_sm90.cu -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,276 copying ./csrc/xqa/mla_sm120.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,278 copying ./csrc/xqa/mma.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,280 copying ./csrc/xqa/xqa_wrapper.cu -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,282 copying ./csrc/xqa/tma.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,285 copying ./csrc/xqa/hostUtils.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,287 copying ./csrc/xqa/cuda_hint.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,289 copying ./csrc/xqa/specDec.h -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,291 copying ./csrc/xqa/gmma_impl.cuh -> build/lib/flashinfer/data/csrc/xqa 2026-04-14T21:02:48,299 copying ./csrc/bmm_fp8.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,301 copying ./csrc/batch_decode_mla_plan.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,303 copying ./csrc/batch_mla_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,305 copying ./csrc/pod.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,308 copying ./csrc/trtllm_fused_moe_runner.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,310 copying ./csrc/batch_mla_sm90_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,312 copying ./csrc/batch_prefill_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,315 copying ./csrc/batch_decode_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,317 creating build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-14T21:02:48,319 copying ./csrc/nv_internal/cpp/common/memoryUtils.cu -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-14T21:02:48,322 copying ./csrc/nv_internal/cpp/common/envUtils.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-14T21:02:48,325 copying ./csrc/nv_internal/cpp/common/tllmException.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-14T21:02:48,327 copying ./csrc/nv_internal/cpp/common/logger.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-14T21:02:48,329 copying ./csrc/nv_internal/cpp/common/stringUtils.cpp -> build/lib/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-14T21:02:48,331 creating build/lib/flashinfer/data/csrc/nv_internal/cpp/kernels 2026-04-14T21:02:48,332 copying ./csrc/nv_internal/cpp/kernels/quantization.cu -> build/lib/flashinfer/data/csrc/nv_internal/cpp/kernels 2026-04-14T21:02:48,336 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-14T21:02:48,338 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/interleaved_numeric_conversion.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-14T21:02:48,341 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2026-04-14T21:02:48,342 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread/fused_activations.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2026-04-14T21:02:48,345 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2026-04-14T21:02:48,346 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective/epilogue_moe_finalize.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2026-04-14T21:02:48,349 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-14T21:02:48,350 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_allreduce_tma_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-14T21:02:48,353 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_scatter.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-14T21:02:48,356 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/tile_interleaved_layout.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-14T21:02:48,358 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-14T21:02:48,359 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_traits_sm90_multimem.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-14T21:02:48,361 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/mma.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-14T21:02:48,363 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_red_global.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-14T21:02:48,366 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/grid_dependency_control.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-14T21:02:48,368 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_sm90_multimem.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-14T21:02:48,370 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/system_barrier.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-14T21:02:48,373 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm_configs.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-14T21:02:48,376 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2026-04-14T21:02:48,377 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective/sm90_allreduce_nvls_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2026-04-14T21:02:48,380 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/compute_occupancy.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-14T21:02:48,382 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:48,384 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_pipelined.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:48,386 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma_bf16.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:48,389 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:48,391 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_percol.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:48,394 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_base.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:48,397 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_percol.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:48,399 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_finegrained.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:48,402 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_finegrained.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:48,405 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:48,407 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:48,410 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_multistage.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:48,413 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:48,416 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-14T21:02:48,417 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/default_mma_tensor_op.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-14T21:02:48,419 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_dequantizer.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-14T21:02:48,422 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_compute_B_with_f16.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-14T21:02:48,424 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:48,425 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:48,429 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:48,432 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input_.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:48,436 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_gated.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:48,438 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_mixed_input.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:48,440 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-14T21:02:48,441 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_interleaved.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-14T21:02:48,444 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_mixed_input.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-14T21:02:48,447 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_gated.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-14T21:02:48,450 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_interleaved_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:48,454 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_interleaved.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:48,456 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_interleaved.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:48,459 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_array_mixed_input.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:48,461 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_gated.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:48,463 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:48,464 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:48,467 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:48,470 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cutlass_kernel.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:48,473 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/default_fpA_intB_traits.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:48,476 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cute_util.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:48,478 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/mixed_gemm_B_layout.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:48,481 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_traits.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:48,483 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_moe_problem_visitor.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:48,485 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_routine.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:48,488 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_universal_allreduce.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:48,491 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:48,493 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_problem_visitor.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:48,496 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2026-04-14T21:02:48,498 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock/fine_grained_scale_zero_iterator.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2026-04-14T21:02:48,500 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2026-04-14T21:02:48,502 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective/mixed_input_utils.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2026-04-14T21:02:48,504 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/weight_only_quant_op.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-14T21:02:48,507 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue_helpers.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-14T21:02:48,509 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2026-04-14T21:02:48,510 copying ./csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util/gather_tensor.hpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2026-04-14T21:02:48,513 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-14T21:02:48,514 copying ./csrc/nv_internal/tensorrt_llm/common/cudaDriverWrapper.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-14T21:02:48,517 copying ./csrc/nv_internal/tensorrt_llm/common/envUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-14T21:02:48,519 copying ./csrc/nv_internal/tensorrt_llm/common/cudaBf16Fallbacks.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-14T21:02:48,521 copying ./csrc/nv_internal/tensorrt_llm/common/reduceKernelUtils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-14T21:02:48,524 copying ./csrc/nv_internal/tensorrt_llm/common/cublasMMWrapper.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-14T21:02:48,527 copying ./csrc/nv_internal/tensorrt_llm/common/quantTypeUtils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-14T21:02:48,529 copying ./csrc/nv_internal/tensorrt_llm/common/memoryUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-14T21:02:48,531 copying ./csrc/nv_internal/tensorrt_llm/common/workspace.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-14T21:02:48,534 copying ./csrc/nv_internal/tensorrt_llm/common/cudaTypeUtils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-14T21:02:48,537 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-14T21:02:48,539 copying ./csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-14T21:02:48,542 copying ./csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-14T21:02:48,545 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-14T21:02:48,546 copying ./csrc/nv_internal/tensorrt_llm/kernels/lora/lora.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-14T21:02:48,548 copying ./csrc/nv_internal/tensorrt_llm/kernels/lora/lora.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-14T21:02:48,551 copying ./csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-14T21:02:48,553 copying ./csrc/nv_internal/tensorrt_llm/kernels/delayStream.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-14T21:02:48,555 copying ./csrc/nv_internal/tensorrt_llm/kernels/quantization.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-14T21:02:48,558 copying ./csrc/nv_internal/tensorrt_llm/kernels/quantization.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-14T21:02:48,560 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-14T21:02:48,562 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_mma_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-14T21:02:48,565 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-14T21:02:48,566 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_fp8_gemm_1d1d.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-14T21:02:48,569 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-14T21:02:48,572 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-14T21:02:48,574 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm_kernel.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-14T21:02:48,578 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_tma_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-14T21:02:48,581 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-14T21:02:48,584 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_type_conversion.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-14T21:02:48,586 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-14T21:02:48,589 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,590 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,592 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_f16_out_f16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,595 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,598 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,600 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,602 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_bf16_out_bf16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,604 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,607 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_bf16_out_bf16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,609 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,611 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,613 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scaleonly.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,615 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,618 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,620 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scalebias.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,623 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,626 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template_sm90.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,629 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,633 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_per_col_f16_out_f16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,636 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_f16_out_f16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,639 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_per_col.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:48,641 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-14T21:02:48,643 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-14T21:02:48,646 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-14T21:02:48,649 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-14T21:02:48,653 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-14T21:02:48,654 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_kernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-14T21:02:48,658 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_gemm_kernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-14T21:02:48,662 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/common.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-14T21:02:48,664 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/cutlass_kernel_selector.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-14T21:02:48,667 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_util_kernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-14T21:02:48,669 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,671 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,675 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,679 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,681 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_bf16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,684 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,687 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,690 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,693 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws_mixed_dtype.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,697 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_kernels.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,700 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,703 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,706 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_uint4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,710 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_tma_warp_specialized_traits.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,713 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_tma_warp_specialized_input.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,717 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp32_fp32.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,720 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp16.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,723 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint8.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,726 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,729 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,732 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp4_fp4.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:48,736 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-14T21:02:48,737 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-14T21:02:48,741 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-14T21:02:48,745 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-14T21:02:48,748 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-14T21:02:48,752 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.inl -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-14T21:02:48,761 copying ./csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-14T21:02:48,765 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-14T21:02:48,766 copying ./csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-14T21:02:48,770 copying ./csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-14T21:02:48,775 copying ./csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-14T21:02:48,778 copying ./csrc/nv_internal/tensorrt_llm/kernels/delayStream.cu -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-14T21:02:48,781 copying ./csrc/nv_internal/tensorrt_llm/kernels/quantization_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-14T21:02:48,785 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:48,787 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_cutlass.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:48,794 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/runtime.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:48,797 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/tma_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:48,800 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:48,805 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:48,808 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_std.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:48,811 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/jit_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:48,814 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/mma_utils.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:48,819 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/scheduler.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:48,822 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm_impl.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:48,826 copying ./csrc/nv_internal/tensorrt_llm/deep_gemm/compiler.cuh -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:48,829 creating build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-14T21:02:48,830 copying ./csrc/nv_internal/tensorrt_llm/thop/utils.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-14T21:02:48,833 copying ./csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-14T21:02:48,835 copying ./csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-14T21:02:48,838 copying ./csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-14T21:02:48,840 copying ./csrc/nv_internal/tensorrt_llm/thop/fp4Op.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-14T21:02:48,843 copying ./csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.cpp -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-14T21:02:48,846 copying ./csrc/nv_internal/tensorrt_llm/thop/moeAlltoAllMeta.h -> build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-14T21:02:48,848 creating build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:48,850 copying ./csrc/nv_internal/include/tensorrt_llm/common/cudaUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:48,854 copying ./csrc/nv_internal/include/tensorrt_llm/common/config.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:48,856 copying ./csrc/nv_internal/include/tensorrt_llm/common/logger.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:48,858 copying ./csrc/nv_internal/include/tensorrt_llm/common/quantization.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:48,861 copying ./csrc/nv_internal/include/tensorrt_llm/common/cudaBf16Wrapper.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:48,863 copying ./csrc/nv_internal/include/tensorrt_llm/common/NvInferRuntime.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:48,865 copying ./csrc/nv_internal/include/tensorrt_llm/common/cudaFp8Utils.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:48,867 copying ./csrc/nv_internal/include/tensorrt_llm/common/assert.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:48,869 copying ./csrc/nv_internal/include/tensorrt_llm/common/stringUtils.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:48,872 copying ./csrc/nv_internal/include/tensorrt_llm/common/dataType.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:48,874 copying ./csrc/nv_internal/include/tensorrt_llm/common/tllmException.h -> build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:48,876 copying ./csrc/cascade.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,878 copying ./csrc/single_prefill_fp8_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,880 copying ./csrc/topk.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,883 copying ./csrc/single_decode_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,885 copying ./csrc/single_prefill_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,887 copying ./csrc/group_gemm_fp8_groupwise_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,889 copying ./csrc/fp8_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,891 copying ./csrc/batch_mla_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,893 copying ./csrc/bf16_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,895 copying ./csrc/gdn_prefill_sm90_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,897 copying ./csrc/trtllm_moe_allreduce_fusion.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,899 copying ./csrc/batch_prefill_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,902 copying ./csrc/seq_chunk_cumsum_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,904 copying ./csrc/dsv3_router_gemm.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,906 copying ./csrc/flat_prefill_kernel_delta_rule_sm90_extern.inc -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,908 copying ./csrc/fp4_gemm_cutlass_sm103.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,911 copying ./csrc/mxfp8_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,912 copying ./csrc/single_decode.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,915 copying ./csrc/flashinfer_sampling_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,917 copying ./csrc/group_gemm_fp8_groupwise_sm100_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,919 copying ./csrc/sampling.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,922 copying ./csrc/group_gemm_mxfp4_groupwise_sm100_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,924 copying ./csrc/batch_prefill.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,926 copying ./csrc/fp4_gemm_cutlass_sm103.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,928 copying ./csrc/group_gemm_nvfp4_groupwise_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,930 copying ./csrc/flashinfer_gemm_sm90_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,932 copying ./csrc/batch_decode.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,935 copying ./csrc/batch_prefill_ragged_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,937 copying ./csrc/trtllm_allreduce.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,939 copying ./csrc/batch_attention_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,941 copying ./csrc/gemm_groupwise_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,943 copying ./csrc/flashinfer_mla_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,945 copying ./csrc/fp4_gemm_cutlass_sm120.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,948 copying ./csrc/trtllm_gemm_runner.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,950 copying ./csrc/single_prefill_sm90_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,952 copying ./csrc/flashinfer_norm_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,954 copying ./csrc/fp4_kv_dequantization.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,956 copying ./csrc/tinygemm2.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,959 copying ./csrc/gemm_groupwise_sm120_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,961 copying ./csrc/fp8_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,963 copying ./csrc/group_gemm_mxfp4_groupwise_sm120_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,965 copying ./csrc/logging.cc -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,967 copying ./csrc/group_gemm.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,969 copying ./csrc/trtllm_fmha_v2_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,972 copying ./csrc/batch_attention.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,974 copying ./csrc/trtllm_alltoall.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,977 copying ./csrc/gemm_groupwise_sm100_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,979 copying ./csrc/fmha_v2_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:48,981 creating build/lib/flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:48,982 copying ./csrc/fmha_v2/fused_multihead_flash_attention_kernel.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:48,985 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:48,987 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:48,990 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:48,992 copying ./csrc/fmha_v2/fused_multihead_attention_utils.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:48,995 copying ./csrc/fmha_v2/fused_multihead_cross_attention.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:48,997 copying ./csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop_tiled.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:49,000 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_1xN_multi_cta.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:49,003 copying ./csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:49,006 copying ./csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:49,008 copying ./csrc/fmha_v2/fused_multihead_attention_kernel.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:49,011 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_1xN_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:49,014 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_1xN.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:49,016 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper_noloop.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:49,019 creating build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-14T21:02:49,020 copying ./csrc/fmha_v2/templates/kernel.jinja -> build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-14T21:02:49,022 copying ./csrc/fmha_v2/templates/kernel_hopper.jinja -> build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-14T21:02:49,025 copying ./csrc/fmha_v2/templates/fa_kernel.jinja -> build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-14T21:02:49,028 copying ./csrc/fmha_v2/templates/kernel_hopper_ws.jinja -> build/lib/flashinfer/data/csrc/fmha_v2/templates 2026-04-14T21:02:49,030 copying ./csrc/fmha_v2/fused_multihead_attention_demo_bert_params.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:49,033 copying ./csrc/fmha_v2/fused_multihead_attention_kernel_2x2.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:49,036 creating build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:49,037 copying ./csrc/fmha_v2/fmha/paged_kv_cache.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:49,039 copying ./csrc/fmha_v2/fmha/softmax.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:49,044 copying ./csrc/fmha_v2/fmha/gmem_tile_o_packed.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:49,047 copying ./csrc/fmha_v2/fmha/gemm.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:49,049 copying ./csrc/fmha_v2/fmha/gmem_tile_ps.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:49,052 copying ./csrc/fmha_v2/fmha/smem_tile.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:49,055 copying ./csrc/fmha_v2/fmha/gmem_tile_o.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:49,058 copying ./csrc/fmha_v2/fmha/fragment.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:49,061 copying ./csrc/fmha_v2/fmha/utils.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:49,065 copying ./csrc/fmha_v2/fmha/kernel_traits.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:49,068 copying ./csrc/fmha_v2/fmha/mask.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:49,070 creating build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:49,071 copying ./csrc/fmha_v2/fmha/hopper/gmem_tile_o_packed.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:49,074 copying ./csrc/fmha_v2/fmha/hopper/smem_tile.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:49,078 copying ./csrc/fmha_v2/fmha/hopper/utils_hgmma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:49,081 copying ./csrc/fmha_v2/fmha/hopper/fragment.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:49,084 copying ./csrc/fmha_v2/fmha/hopper/gmma_descriptor.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:49,087 copying ./csrc/fmha_v2/fmha/hopper/compute_tile.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:49,090 copying ./csrc/fmha_v2/fmha/hopper/kernel_traits.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:49,092 copying ./csrc/fmha_v2/fmha/hopper/utils_gmma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:49,094 copying ./csrc/fmha_v2/fmha/hopper/tma_descriptor.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:49,097 copying ./csrc/fmha_v2/fmha/hopper/gmem_tile_qkv_packed.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:49,099 copying ./csrc/fmha_v2/fmha/hopper/utils_hgmma_bf16.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:49,102 copying ./csrc/fmha_v2/fmha/hopper/utils_tma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:49,104 copying ./csrc/fmha_v2/fmha/hopper/utils_igmma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:49,107 copying ./csrc/fmha_v2/fmha/hopper/arrive_wait.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:49,110 copying ./csrc/fmha_v2/fmha/hopper/utils_qgmma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:49,114 copying ./csrc/fmha_v2/fmha/hopper/tma_types.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:49,116 copying ./csrc/fmha_v2/fmha/hopper/smem_tile_o.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:49,118 copying ./csrc/fmha_v2/fmha/hopper/utils_warpgroup.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:49,120 copying ./csrc/fmha_v2/fmha/traits.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:49,123 copying ./csrc/fmha_v2/fmha/gmem_tile_qkv_packed.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:49,126 copying ./csrc/fmha_v2/fmha/numeric_types.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:49,128 copying ./csrc/fmha_v2/fmha/alibi_params.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:49,130 creating build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-14T21:02:49,131 copying ./csrc/fmha_v2/fmha/warpspec/circular_buffer.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-14T21:02:49,133 copying ./csrc/fmha_v2/fmha/warpspec/kernel_traits.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-14T21:02:49,136 copying ./csrc/fmha_v2/fmha/warpspec/compute.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-14T21:02:49,139 copying ./csrc/fmha_v2/fmha/warpspec/epilogue.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-14T21:02:49,142 copying ./csrc/fmha_v2/fmha/warpspec/dma.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-14T21:02:49,145 copying ./csrc/fmha_v2/fmha/gmem_tile_qkv.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:49,147 copying ./csrc/fmha_v2/fmha/smem_tile_v.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:49,151 copying ./csrc/fmha_v2/fmha/smem_tile_o.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:49,154 copying ./csrc/fmha_v2/fmha/smem_tile_qkv.h -> build/lib/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:49,157 copying ./csrc/fmha_v2/fused_multihead_attention.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:49,160 copying ./csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN.h -> build/lib/flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:49,162 copying ./csrc/gemm_groupwise_sm100.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,164 copying ./csrc/pod_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,167 copying ./csrc/single_prefill_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,169 copying ./csrc/mxfp8_gemm_cutlass_sm120.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,171 copying ./csrc/selective_state_update_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,173 copying ./csrc/batch_mla_plan.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,175 copying ./csrc/tgv_gemm.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,177 copying ./csrc/batch_prefill_sm90_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,179 copying ./csrc/trtllm_fmha_kernel_launcher.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,182 copying ./csrc/trtllm_allreduce_fusion.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,185 copying ./csrc/batch_prefill_paged_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,187 copying ./csrc/flashinfer_mamba_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,189 copying ./csrc/fmha_cutlass_sm100.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,191 copying ./csrc/group_gemm_fp8_groupwise_sm100.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,194 copying ./csrc/fmha_cutlass_sm100_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,196 copying ./csrc/batch_decode_mla_run.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,198 copying ./csrc/single_prefill_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,200 copying ./csrc/batch_prefill_fp8_sm90.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,202 copying ./csrc/fp4_gemm_cutlass.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,204 copying ./csrc/quantization.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,206 copying ./csrc/group_gemm_fp8_groupwise_sm120_kernel_inst.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,208 copying ./csrc/fp4_gemm_cutlass.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,210 copying ./csrc/batch_decode_mla_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,212 copying ./csrc/pod_customize_config.jinja -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,214 copying ./csrc/cudnn_sdpa_kernel_launcher.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,218 copying ./csrc/trtllm_mnnvl_allreduce.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,220 copying ./csrc/cudnn_sdpa_utils.h -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,223 copying ./csrc/batch_decode_jit_binding.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,225 copying ./csrc/fp4_kv_quantization.cu -> build/lib/flashinfer/data/csrc 2026-04-14T21:02:49,228 creating build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,229 copying ./include/flashinfer/topk.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,234 copying ./include/flashinfer/fastdiv.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,236 creating build/lib/flashinfer/data/include/flashinfer/norm 2026-04-14T21:02:49,237 copying ./include/flashinfer/norm/ln_fwd_silu_kernel.cuh -> build/lib/flashinfer/data/include/flashinfer/norm 2026-04-14T21:02:49,240 copying ./include/flashinfer/norm/ln_silu_headers.cuh -> build/lib/flashinfer/data/include/flashinfer/norm 2026-04-14T21:02:49,244 copying ./include/flashinfer/math.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,246 copying ./include/flashinfer/cubin_loader.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,248 creating build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-14T21:02:49,250 copying ./include/flashinfer/flat/ampere/collective/flat_collective_inverse.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-14T21:02:49,253 copying ./include/flashinfer/flat/ampere/collective/flat_collective_load.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-14T21:02:49,256 copying ./include/flashinfer/flat/unused.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-14T21:02:49,258 creating build/lib/flashinfer/data/include/flashinfer/flat/prefill 2026-04-14T21:02:49,259 copying ./include/flashinfer/flat/prefill/prefill_kernel.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/prefill 2026-04-14T21:02:49,261 copying ./include/flashinfer/flat/prefill/prefill_kernel_delta_rule_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/flat/prefill 2026-04-14T21:02:49,264 creating build/lib/flashinfer/data/include/flashinfer/flat/hopper/device 2026-04-14T21:02:49,266 copying ./include/flashinfer/flat/hopper/device/device_universal.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/device 2026-04-14T21:02:49,268 creating build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-14T21:02:49,269 copying ./include/flashinfer/flat/hopper/collective/flat_collective_store.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-14T21:02:49,272 copying ./include/flashinfer/flat/hopper/collective/flat_named_barriers.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-14T21:02:49,274 copying ./include/flashinfer/flat/hopper/collective/flat_collective_load.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-14T21:02:49,277 copying ./include/flashinfer/flat/hopper/collective/flat_common.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-14T21:02:49,280 copying ./include/flashinfer/flat/hopper/collective/flat_collective_tma_warpspecialized_delta_rule.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-14T21:02:49,283 creating build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-14T21:02:49,284 copying ./include/flashinfer/flat/hopper/kernel/flat_kernel_tma_warpspecialized_delta_rule.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-14T21:02:49,287 copying ./include/flashinfer/flat/hopper/kernel/flat_options.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-14T21:02:49,289 copying ./include/flashinfer/flat/hopper/kernel/flat_kernel_builder_delta_rule.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-14T21:02:49,291 copying ./include/flashinfer/flat/hopper/kernel/flat_tile_scheduler.hpp -> build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-14T21:02:49,294 copying ./include/flashinfer/flat/type_traits.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-14T21:02:49,296 copying ./include/flashinfer/flat/cute_ext.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-14T21:02:49,298 copying ./include/flashinfer/flat/math.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-14T21:02:49,300 copying ./include/flashinfer/flat/math_order_barrier.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-14T21:02:49,302 copying ./include/flashinfer/flat/common.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-14T21:02:49,305 copying ./include/flashinfer/flat/debug.hpp -> build/lib/flashinfer/data/include/flashinfer/flat 2026-04-14T21:02:49,308 copying ./include/flashinfer/fp16.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,310 copying ./include/flashinfer/vec_dtypes.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,314 copying ./include/flashinfer/cutlass_utils.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,316 copying ./include/flashinfer/allocator.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,318 copying ./include/flashinfer/activation.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,320 copying ./include/flashinfer/exception.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,322 copying ./include/flashinfer/utils.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,325 copying ./include/flashinfer/quantization.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,327 copying ./include/flashinfer/norm.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,330 creating build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,332 copying ./include/flashinfer/gemm/mxfp8_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,335 copying ./include/flashinfer/gemm/tgv_gemm_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,337 copying ./include/flashinfer/gemm/tgv_gemm_configs.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,339 copying ./include/flashinfer/gemm/bmm_fp8.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,342 copying ./include/flashinfer/gemm/gemm_groupwise_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,345 copying ./include/flashinfer/gemm/group_gemm_fp8_groupwise_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,347 copying ./include/flashinfer/gemm/fp8_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,350 copying ./include/flashinfer/gemm/mxfp8_gemm_template_sm120.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,353 copying ./include/flashinfer/gemm/fp4_gemm_template_sm120.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,355 copying ./include/flashinfer/gemm/mxfp8_gemm_cutlass_template_sm120.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,358 copying ./include/flashinfer/gemm/fp8_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,360 copying ./include/flashinfer/gemm/mxfp8_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,362 copying ./include/flashinfer/gemm/fp4_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,365 copying ./include/flashinfer/gemm/group_gemm_fp8_groupwise_sm120.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,368 copying ./include/flashinfer/gemm/fp4_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,370 copying ./include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,372 copying ./include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm120.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,375 copying ./include/flashinfer/gemm/bf16_gemm_cutlass.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,377 copying ./include/flashinfer/gemm/group_gemv.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,379 copying ./include/flashinfer/gemm/fp4_gemm_template_sm103.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,382 copying ./include/flashinfer/gemm/group_gemm.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,384 copying ./include/flashinfer/gemm/gemm_groupwise_sm120.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,386 copying ./include/flashinfer/gemm/bf16_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,389 copying ./include/flashinfer/gemm/fp4_gemm_cutlass_template_sm103.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,392 copying ./include/flashinfer/gemm/fp4_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,394 copying ./include/flashinfer/gemm/fp8_gemm_template_sm100.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,397 copying ./include/flashinfer/gemm/mxfp8_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,400 copying ./include/flashinfer/gemm/group_gemm_nvfp4_groupwise_sm120.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,403 copying ./include/flashinfer/gemm/cutlass_gemm_configs.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,406 copying ./include/flashinfer/gemm/fp4_gemm_cutlass_template_sm120.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,408 copying ./include/flashinfer/gemm/group_gemm_lora.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,411 copying ./include/flashinfer/gemm/tgv_gemm.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,415 copying ./include/flashinfer/gemm/dsv3_router_gemm.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,417 copying ./include/flashinfer/gemm/group_gemm_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,420 copying ./include/flashinfer/gemm/bf16_gemm_cutlass_template.h -> build/lib/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:49,422 copying ./include/flashinfer/attention_impl.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,424 copying ./include/flashinfer/arch_condition.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,426 copying ./include/flashinfer/sampling.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,430 copying ./include/flashinfer/fp4_layout.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,432 creating build/lib/flashinfer/data/include/flashinfer/comm 2026-04-14T21:02:49,433 copying ./include/flashinfer/comm/trtllm_allreduce_fusion.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-14T21:02:49,436 copying ./include/flashinfer/comm/trtllm_allreduce.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-14T21:02:49,439 copying ./include/flashinfer/comm/trtllm_moe_allreduce_fusion.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-14T21:02:49,443 copying ./include/flashinfer/comm/vllm_custom_all_reduce.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-14T21:02:49,446 copying ./include/flashinfer/comm/trtllm_mnnvl_allreduce.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-14T21:02:49,449 copying ./include/flashinfer/comm/trtllm_alltoall.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-14T21:02:49,452 copying ./include/flashinfer/comm/trtllm_alltoall_prepare.cuh -> build/lib/flashinfer/data/include/flashinfer/comm 2026-04-14T21:02:49,454 copying ./include/flashinfer/pos_enc.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,458 copying ./include/flashinfer/layout.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,460 copying ./include/flashinfer/air_top_p.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,463 copying ./include/flashinfer/permuted_smem.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,465 copying ./include/flashinfer/concat_mla.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,468 creating build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:49,469 copying ./include/flashinfer/mamba/seq_chunk_cumsum.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:49,472 copying ./include/flashinfer/mamba/create_tensor_map.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:49,475 copying ./include/flashinfer/mamba/invoke_selective_state_update_mtp.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:49,478 copying ./include/flashinfer/mamba/common.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:49,480 copying ./include/flashinfer/mamba/kernel_selective_state_update_mtp_horizontal.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:49,483 copying ./include/flashinfer/mamba/selective_state_update.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:49,485 copying ./include/flashinfer/mamba/kernel_selective_state_update_mtp_vertical.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:49,488 copying ./include/flashinfer/mamba/kernel_selective_state_update_stp.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:49,492 copying ./include/flashinfer/mamba/kernel_selective_state_update_mtp_async_horizontal.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:49,495 copying ./include/flashinfer/mamba/ssu_mtp_common.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:49,497 copying ./include/flashinfer/mamba/kernel_selective_state_update_mtp_simple.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:49,500 copying ./include/flashinfer/mamba/conversion.cuh -> build/lib/flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:49,502 copying ./include/flashinfer/profiler.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,505 copying ./include/flashinfer/mma.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,508 copying ./include/flashinfer/frag_layout_swizzle.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,510 creating build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,511 copying ./include/flashinfer/attention/state.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,513 copying ./include/flashinfer/attention/mla.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,517 copying ./include/flashinfer/attention/variants.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,519 copying ./include/flashinfer/attention/persistent.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,521 copying ./include/flashinfer/attention/decode_mla_cute_sm80.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,525 copying ./include/flashinfer/attention/heap.h -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,527 creating build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:49,528 copying ./include/flashinfer/attention/hopper/sparse_mainloop.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:49,532 copying ./include/flashinfer/attention/hopper/tile_scheduler.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:49,535 copying ./include/flashinfer/attention/hopper/kernel_traits.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:49,538 copying ./include/flashinfer/attention/hopper/variants.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:49,541 copying ./include/flashinfer/attention/hopper/mainloop.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:49,544 copying ./include/flashinfer/attention/hopper/utils.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:49,547 copying ./include/flashinfer/attention/hopper/attention_updater.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:49,550 copying ./include/flashinfer/attention/hopper/prefill_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:49,554 copying ./include/flashinfer/attention/hopper/mainloop_mma.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:49,557 copying ./include/flashinfer/attention/hopper/variant_helper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:49,560 copying ./include/flashinfer/attention/hopper/epilogue.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:49,563 copying ./include/flashinfer/attention/hopper/named_barrier.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:49,566 copying ./include/flashinfer/attention/hopper/default_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:49,569 creating build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-14T21:02:49,570 copying ./include/flashinfer/attention/hopper/quantization/kernel_traits.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-14T21:02:49,574 copying ./include/flashinfer/attention/hopper/quantization/prefill_sm90.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-14T21:02:49,578 copying ./include/flashinfer/attention/hopper/quantization/mainloop_mma.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-14T21:02:49,581 copying ./include/flashinfer/attention/hopper/quantization/mainloop_load.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-14T21:02:49,585 copying ./include/flashinfer/attention/hopper/quantization/epilogue.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-14T21:02:49,588 copying ./include/flashinfer/attention/hopper/quantization/mainloop_sparse_load.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-14T21:02:49,592 copying ./include/flashinfer/attention/mla_hopper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,596 copying ./include/flashinfer/attention/default_prefill_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,599 copying ./include/flashinfer/attention/mask.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,602 copying ./include/flashinfer/attention/default_decode_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,605 copying ./include/flashinfer/attention/hopper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,608 copying ./include/flashinfer/attention/cutlass_mla.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,611 copying ./include/flashinfer/attention/decode.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,615 copying ./include/flashinfer/attention/cascade.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,619 copying ./include/flashinfer/attention/pod.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,622 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell 2026-04-14T21:02:49,623 copying ./include/flashinfer/attention/blackwell/plan.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell 2026-04-14T21:02:49,626 copying ./include/flashinfer/attention/blackwell/fmha_cutlass_sm100.cuh -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell 2026-04-14T21:02:49,628 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-14T21:02:49,629 copying ./include/flashinfer/attention/blackwell/device/fmha.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-14T21:02:49,632 copying ./include/flashinfer/attention/blackwell/device/sm100_mla.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-14T21:02:49,635 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/common 2026-04-14T21:02:49,636 copying ./include/flashinfer/attention/blackwell/common/pow_2.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/common 2026-04-14T21:02:49,638 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-14T21:02:49,639 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_load_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-14T21:02:49,642 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_load_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-14T21:02:49,645 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_epilogue_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-14T21:02:49,647 copying ./include/flashinfer/attention/blackwell/collective/fmha_common.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-14T21:02:49,650 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_epilogue_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-14T21:02:49,652 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_mainloop_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-14T21:02:49,655 copying ./include/flashinfer/attention/blackwell/collective/fmha_fusion.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-14T21:02:49,658 copying ./include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_mainloop_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-14T21:02:49,661 creating build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-14T21:02:49,662 copying ./include/flashinfer/attention/blackwell/kernel/fmha_options.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-14T21:02:49,664 copying ./include/flashinfer/attention/blackwell/kernel/sm100_mla_tile_scheduler.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-14T21:02:49,667 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_gen_kernel_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-14T21:02:49,669 copying ./include/flashinfer/attention/blackwell/kernel/fmha_tile_scheduler.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-14T21:02:49,672 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_reduction.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-14T21:02:49,674 copying ./include/flashinfer/attention/blackwell/kernel/gather_tensor.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-14T21:02:49,677 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-14T21:02:49,680 copying ./include/flashinfer/attention/blackwell/kernel/sm100_fmha_fwd_kernel_tma_warpspecialized.hpp -> build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-14T21:02:49,683 copying ./include/flashinfer/attention/batch_pod.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,686 copying ./include/flashinfer/attention/variant_helper.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,689 copying ./include/flashinfer/attention/prefill.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,694 copying ./include/flashinfer/attention/mla_params.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,696 copying ./include/flashinfer/attention/scheduler.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,699 copying ./include/flashinfer/attention/persistent_template.cuh -> build/lib/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:49,702 copying ./include/flashinfer/cp_async.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,705 creating build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-14T21:02:49,707 copying ./include/flashinfer/trtllm/fused_moe/IntFastDiv.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-14T21:02:49,710 copying ./include/flashinfer/trtllm/fused_moe/runner.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-14T21:02:49,713 copying ./include/flashinfer/trtllm/fused_moe/RoutingCustomPolicy.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-14T21:02:49,716 copying ./include/flashinfer/trtllm/fused_moe/RoutingKernel.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-14T21:02:49,719 copying ./include/flashinfer/trtllm/fused_moe/RoutingKernel.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-14T21:02:49,722 copying ./include/flashinfer/trtllm/fused_moe/RoutingDevKernel.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-14T21:02:49,725 copying ./include/flashinfer/trtllm/fused_moe/noAuxTcKernels.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-14T21:02:49,727 copying ./include/flashinfer/trtllm/fused_moe/RoutingKernelTopK.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-14T21:02:49,730 copying ./include/flashinfer/trtllm/fused_moe/DevKernel.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-14T21:02:49,733 creating build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-14T21:02:49,734 copying ./include/flashinfer/trtllm/common/cudaUtils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-14T21:02:49,736 copying ./include/flashinfer/trtllm/common/cudaBf16Fallbacks.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-14T21:02:49,739 copying ./include/flashinfer/trtllm/common/reduceKernelUtils.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-14T21:02:49,742 copying ./include/flashinfer/trtllm/common/cudaBf16Wrapper.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-14T21:02:49,744 copying ./include/flashinfer/trtllm/common/cudaTypeUtils.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-14T21:02:49,746 copying ./include/flashinfer/trtllm/common/cudaFp8Utils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/common 2026-04-14T21:02:49,749 creating build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm 2026-04-14T21:02:49,750 copying ./include/flashinfer/trtllm/batched_gemm/KernelRunner.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm 2026-04-14T21:02:49,752 copying ./include/flashinfer/trtllm/common.h -> build/lib/flashinfer/data/include/flashinfer/trtllm 2026-04-14T21:02:49,755 creating build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-14T21:02:49,756 copying ./include/flashinfer/trtllm/fmha/kernelParams.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-14T21:02:49,759 copying ./include/flashinfer/trtllm/fmha/fmhaKernels.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-14T21:02:49,763 copying ./include/flashinfer/trtllm/fmha/decoder_params.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-14T21:02:49,765 copying ./include/flashinfer/trtllm/fmha/lse.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-14T21:02:49,767 copying ./include/flashinfer/trtllm/fmha/fmhaRunnerParams.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-14T21:02:49,769 copying ./include/flashinfer/trtllm/fmha/kernelUtils.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-14T21:02:49,772 copying ./include/flashinfer/trtllm/fmha/fmhaReduction.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-14T21:02:49,774 copying ./include/flashinfer/trtllm/fmha/decoder_impl_common.h -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-14T21:02:49,777 copying ./include/flashinfer/trtllm/fmha/fmhaRunner.cuh -> build/lib/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-14T21:02:49,779 copying ./include/flashinfer/logging.h -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,781 copying ./include/flashinfer/page.cuh -> build/lib/flashinfer/data/include/flashinfer 2026-04-14T21:02:49,784 creating build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:49,786 copying 3rdparty/spdlog/include/spdlog/async_logger-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:49,788 creating build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,789 copying 3rdparty/spdlog/include/spdlog/details/mpmc_blocking_q.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,792 copying 3rdparty/spdlog/include/spdlog/details/file_helper-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,795 copying 3rdparty/spdlog/include/spdlog/details/file_helper.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,797 copying 3rdparty/spdlog/include/spdlog/details/backtracer.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,799 copying 3rdparty/spdlog/include/spdlog/details/synchronous_factory.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,801 copying 3rdparty/spdlog/include/spdlog/details/fmt_helper.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,803 copying 3rdparty/spdlog/include/spdlog/details/tcp_client-windows.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,806 copying 3rdparty/spdlog/include/spdlog/details/tcp_client.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,808 copying 3rdparty/spdlog/include/spdlog/details/periodic_worker-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,810 copying 3rdparty/spdlog/include/spdlog/details/circular_q.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,812 copying 3rdparty/spdlog/include/spdlog/details/os.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,815 copying 3rdparty/spdlog/include/spdlog/details/null_mutex.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,817 copying 3rdparty/spdlog/include/spdlog/details/os-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,820 copying 3rdparty/spdlog/include/spdlog/details/thread_pool-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,823 copying 3rdparty/spdlog/include/spdlog/details/log_msg_buffer-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,825 copying 3rdparty/spdlog/include/spdlog/details/periodic_worker.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,827 copying 3rdparty/spdlog/include/spdlog/details/console_globals.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,829 copying 3rdparty/spdlog/include/spdlog/details/windows_include.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,832 copying 3rdparty/spdlog/include/spdlog/details/udp_client-windows.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,834 copying 3rdparty/spdlog/include/spdlog/details/log_msg_buffer.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,836 copying 3rdparty/spdlog/include/spdlog/details/backtracer-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,838 copying 3rdparty/spdlog/include/spdlog/details/thread_pool.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,840 copying 3rdparty/spdlog/include/spdlog/details/log_msg-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,843 copying 3rdparty/spdlog/include/spdlog/details/log_msg.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,845 copying 3rdparty/spdlog/include/spdlog/details/registry-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,848 copying 3rdparty/spdlog/include/spdlog/details/udp_client.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,850 copying 3rdparty/spdlog/include/spdlog/details/registry.h -> build/lib/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:49,852 copying 3rdparty/spdlog/include/spdlog/mdc.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:49,855 copying 3rdparty/spdlog/include/spdlog/pattern_formatter-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:49,858 copying 3rdparty/spdlog/include/spdlog/common-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:49,860 copying 3rdparty/spdlog/include/spdlog/tweakme.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:49,863 copying 3rdparty/spdlog/include/spdlog/async.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:49,865 copying 3rdparty/spdlog/include/spdlog/formatter.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:49,867 creating build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-14T21:02:49,868 copying 3rdparty/spdlog/include/spdlog/fmt/chrono.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-14T21:02:49,870 copying 3rdparty/spdlog/include/spdlog/fmt/compile.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-14T21:02:49,872 copying 3rdparty/spdlog/include/spdlog/fmt/bin_to_hex.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-14T21:02:49,875 creating build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:49,876 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/chrono.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:49,879 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/args.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:49,882 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/core.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:49,887 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/compile.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:49,889 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/xchar.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:49,892 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/color.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:49,895 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/fmt.license.rst -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:49,897 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/os.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:49,900 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/std.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:49,903 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/format-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:49,907 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/ranges.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:49,910 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/format.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:49,916 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/locale.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:49,918 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/ostream.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:49,921 copying 3rdparty/spdlog/include/spdlog/fmt/bundled/printf.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:49,923 copying 3rdparty/spdlog/include/spdlog/fmt/xchar.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-14T21:02:49,926 copying 3rdparty/spdlog/include/spdlog/fmt/std.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-14T21:02:49,928 copying 3rdparty/spdlog/include/spdlog/fmt/ranges.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-14T21:02:49,930 copying 3rdparty/spdlog/include/spdlog/fmt/fmt.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-14T21:02:49,932 copying 3rdparty/spdlog/include/spdlog/fmt/ostr.h -> build/lib/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-14T21:02:49,934 creating build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,935 copying 3rdparty/spdlog/include/spdlog/sinks/null_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,938 copying 3rdparty/spdlog/include/spdlog/sinks/ostream_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,940 copying 3rdparty/spdlog/include/spdlog/sinks/ansicolor_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,943 copying 3rdparty/spdlog/include/spdlog/sinks/wincolor_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,946 copying 3rdparty/spdlog/include/spdlog/sinks/wincolor_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,948 copying 3rdparty/spdlog/include/spdlog/sinks/base_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,950 copying 3rdparty/spdlog/include/spdlog/sinks/sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,952 copying 3rdparty/spdlog/include/spdlog/sinks/mongo_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,954 copying 3rdparty/spdlog/include/spdlog/sinks/android_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,957 copying 3rdparty/spdlog/include/spdlog/sinks/basic_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,959 copying 3rdparty/spdlog/include/spdlog/sinks/ringbuffer_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,961 copying 3rdparty/spdlog/include/spdlog/sinks/qt_sinks.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,964 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_sinks.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,966 copying 3rdparty/spdlog/include/spdlog/sinks/ansicolor_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,969 copying 3rdparty/spdlog/include/spdlog/sinks/dup_filter_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,971 copying 3rdparty/spdlog/include/spdlog/sinks/sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,973 copying 3rdparty/spdlog/include/spdlog/sinks/rotating_file_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,975 copying 3rdparty/spdlog/include/spdlog/sinks/tcp_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,977 copying 3rdparty/spdlog/include/spdlog/sinks/systemd_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,980 copying 3rdparty/spdlog/include/spdlog/sinks/syslog_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,982 copying 3rdparty/spdlog/include/spdlog/sinks/win_eventlog_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,985 copying 3rdparty/spdlog/include/spdlog/sinks/dist_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,987 copying 3rdparty/spdlog/include/spdlog/sinks/rotating_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,989 copying 3rdparty/spdlog/include/spdlog/sinks/udp_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,991 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_color_sinks.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,993 copying 3rdparty/spdlog/include/spdlog/sinks/callback_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,995 copying 3rdparty/spdlog/include/spdlog/sinks/kafka_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:49,998 copying 3rdparty/spdlog/include/spdlog/sinks/daily_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:50,000 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_color_sinks-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:50,003 copying 3rdparty/spdlog/include/spdlog/sinks/stdout_sinks-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:50,005 copying 3rdparty/spdlog/include/spdlog/sinks/hourly_file_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:50,008 copying 3rdparty/spdlog/include/spdlog/sinks/basic_file_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:50,010 copying 3rdparty/spdlog/include/spdlog/sinks/msvc_sink.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:50,012 copying 3rdparty/spdlog/include/spdlog/sinks/base_sink-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:50,014 copying 3rdparty/spdlog/include/spdlog/spdlog-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:50,016 copying 3rdparty/spdlog/include/spdlog/logger.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:50,018 copying 3rdparty/spdlog/include/spdlog/version.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:50,020 copying 3rdparty/spdlog/include/spdlog/common.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:50,023 copying 3rdparty/spdlog/include/spdlog/pattern_formatter.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:50,025 copying 3rdparty/spdlog/include/spdlog/fwd.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:50,027 copying 3rdparty/spdlog/include/spdlog/logger-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:50,029 creating build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-14T21:02:50,030 copying 3rdparty/spdlog/include/spdlog/cfg/helpers-inl.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-14T21:02:50,032 copying 3rdparty/spdlog/include/spdlog/cfg/helpers.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-14T21:02:50,034 copying 3rdparty/spdlog/include/spdlog/cfg/argv.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-14T21:02:50,036 copying 3rdparty/spdlog/include/spdlog/cfg/env.h -> build/lib/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-14T21:02:50,038 copying 3rdparty/spdlog/include/spdlog/spdlog.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:50,040 copying 3rdparty/spdlog/include/spdlog/async_logger.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:50,042 copying 3rdparty/spdlog/include/spdlog/stopwatch.h -> build/lib/flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:50,044 creating build/lib/flashinfer/data/cutlass/include/cute 2026-04-14T21:02:50,046 copying 3rdparty/cutlass/include/cute/swizzle.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-14T21:02:50,048 creating build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,049 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_desc.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,052 copying 3rdparty/cutlass/include/cute/arch/mma_sm70.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,055 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,097 copying 3rdparty/cutlass/include/cute/arch/copy.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,099 copying 3rdparty/cutlass/include/cute/arch/mma_sm120_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,104 copying 3rdparty/cutlass/include/cute/arch/config.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,107 copying 3rdparty/cutlass/include/cute/arch/copy_sm90_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,111 copying 3rdparty/cutlass/include/cute/arch/cluster_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,113 copying 3rdparty/cutlass/include/cute/arch/tmem_allocator_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,116 copying 3rdparty/cutlass/include/cute/arch/util.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,119 copying 3rdparty/cutlass/include/cute/arch/copy_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,130 copying 3rdparty/cutlass/include/cute/arch/mma_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,134 copying 3rdparty/cutlass/include/cute/arch/mma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,136 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,159 copying 3rdparty/cutlass/include/cute/arch/mma_sm61.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,161 copying 3rdparty/cutlass/include/cute/arch/copy_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,164 copying 3rdparty/cutlass/include/cute/arch/mma_sm100_umma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,168 copying 3rdparty/cutlass/include/cute/arch/mma_sm100_desc.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,170 copying 3rdparty/cutlass/include/cute/arch/copy_sm50.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,172 copying 3rdparty/cutlass/include/cute/arch/copy_sm100_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,175 copying 3rdparty/cutlass/include/cute/arch/mma_sm120.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,179 copying 3rdparty/cutlass/include/cute/arch/mma_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,181 copying 3rdparty/cutlass/include/cute/arch/mma_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,183 copying 3rdparty/cutlass/include/cute/arch/cluster_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,186 copying 3rdparty/cutlass/include/cute/arch/copy_sm90_desc.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,188 copying 3rdparty/cutlass/include/cute/arch/mma_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,199 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma_sparse_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,253 copying 3rdparty/cutlass/include/cute/arch/copy_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,256 copying 3rdparty/cutlass/include/cute/arch/copy_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,258 copying 3rdparty/cutlass/include/cute/arch/mma_sm89.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,261 copying 3rdparty/cutlass/include/cute/arch/simd_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,263 copying 3rdparty/cutlass/include/cute/arch/mma_sm90_gmma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:50,280 copying 3rdparty/cutlass/include/cute/pointer_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-14T21:02:50,282 copying 3rdparty/cutlass/include/cute/config.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-14T21:02:50,285 copying 3rdparty/cutlass/include/cute/stride.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-14T21:02:50,288 copying 3rdparty/cutlass/include/cute/pointer_base.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-14T21:02:50,290 creating build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-14T21:02:50,291 copying 3rdparty/cutlass/include/cute/numeric/integer_sequence.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-14T21:02:50,294 copying 3rdparty/cutlass/include/cute/numeric/integral_ratio.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-14T21:02:50,296 copying 3rdparty/cutlass/include/cute/numeric/numeric_types.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-14T21:02:50,299 copying 3rdparty/cutlass/include/cute/numeric/arithmetic_tuple.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-14T21:02:50,301 copying 3rdparty/cutlass/include/cute/numeric/integral_constant.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-14T21:02:50,304 copying 3rdparty/cutlass/include/cute/numeric/complex.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-14T21:02:50,306 copying 3rdparty/cutlass/include/cute/numeric/math.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-14T21:02:50,309 copying 3rdparty/cutlass/include/cute/numeric/real.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-14T21:02:50,311 copying 3rdparty/cutlass/include/cute/numeric/int.hpp -> build/lib/flashinfer/data/cutlass/include/cute/numeric 2026-04-14T21:02:50,313 creating build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-14T21:02:50,314 copying 3rdparty/cutlass/include/cute/container/array_aligned.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-14T21:02:50,316 copying 3rdparty/cutlass/include/cute/container/array_subbyte.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-14T21:02:50,319 copying 3rdparty/cutlass/include/cute/container/cuda_types.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-14T21:02:50,321 copying 3rdparty/cutlass/include/cute/container/type_list.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-14T21:02:50,323 copying 3rdparty/cutlass/include/cute/container/bit_field.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-14T21:02:50,325 copying 3rdparty/cutlass/include/cute/container/alignment.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-14T21:02:50,327 copying 3rdparty/cutlass/include/cute/container/tuple.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-14T21:02:50,331 copying 3rdparty/cutlass/include/cute/container/array.hpp -> build/lib/flashinfer/data/cutlass/include/cute/container 2026-04-14T21:02:50,334 copying 3rdparty/cutlass/include/cute/tensor_zip.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-14T21:02:50,338 copying 3rdparty/cutlass/include/cute/swizzle_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-14T21:02:50,341 copying 3rdparty/cutlass/include/cute/layout.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-14T21:02:50,346 creating build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:50,348 copying 3rdparty/cutlass/include/cute/algorithm/copy.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:50,352 copying 3rdparty/cutlass/include/cute/algorithm/prefetch.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:50,356 copying 3rdparty/cutlass/include/cute/algorithm/axpby.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:50,359 copying 3rdparty/cutlass/include/cute/algorithm/cooperative_copy.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:50,363 copying 3rdparty/cutlass/include/cute/algorithm/gemm.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:50,367 copying 3rdparty/cutlass/include/cute/algorithm/tensor_reduce.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:50,371 copying 3rdparty/cutlass/include/cute/algorithm/tuple_algorithms.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:50,375 copying 3rdparty/cutlass/include/cute/algorithm/clear.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:50,379 copying 3rdparty/cutlass/include/cute/algorithm/tensor_algorithms.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:50,382 copying 3rdparty/cutlass/include/cute/algorithm/prefer.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:50,385 copying 3rdparty/cutlass/include/cute/algorithm/fill.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:50,389 copying 3rdparty/cutlass/include/cute/algorithm/cooperative_gemm.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:50,393 copying 3rdparty/cutlass/include/cute/algorithm/functional.hpp -> build/lib/flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:50,396 copying 3rdparty/cutlass/include/cute/underscore.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-14T21:02:50,400 copying 3rdparty/cutlass/include/cute/int_tuple.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-14T21:02:50,404 creating build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,406 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,424 copying 3rdparty/cutlass/include/cute/atom/mma_atom.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,427 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,435 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm70.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,437 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm120_sparse.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,440 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm61.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,442 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,444 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse_ext.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,455 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,458 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm89.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,460 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,465 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,467 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm100_im2col.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,470 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,472 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm50.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,474 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm80.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,477 copying 3rdparty/cutlass/include/cute/atom/partitioner.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,479 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm90_gmma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,486 copying 3rdparty/cutlass/include/cute/atom/copy_traits.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,489 copying 3rdparty/cutlass/include/cute/atom/copy_atom.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,491 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm120.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,494 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90_im2col.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,497 copying 3rdparty/cutlass/include/cute/atom/mma_traits.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,500 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm75.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,502 copying 3rdparty/cutlass/include/cute/atom/mma_traits_sm100.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,508 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm100_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,511 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90_tma_swizzle.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,513 copying 3rdparty/cutlass/include/cute/atom/copy_traits_sm90_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:50,517 copying 3rdparty/cutlass/include/cute/layout_composed.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-14T21:02:50,519 copying 3rdparty/cutlass/include/cute/tensor.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-14T21:02:50,521 copying 3rdparty/cutlass/include/cute/pointer_flagged.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-14T21:02:50,524 copying 3rdparty/cutlass/include/cute/pointer_swizzle.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-14T21:02:50,526 copying 3rdparty/cutlass/include/cute/tensor_impl.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-14T21:02:50,529 copying 3rdparty/cutlass/include/cute/pointer.hpp -> build/lib/flashinfer/data/cutlass/include/cute 2026-04-14T21:02:50,532 creating build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-14T21:02:50,533 copying 3rdparty/cutlass/include/cute/util/type_traits.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-14T21:02:50,535 copying 3rdparty/cutlass/include/cute/util/print_latex.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-14T21:02:50,538 copying 3rdparty/cutlass/include/cute/util/print.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-14T21:02:50,540 copying 3rdparty/cutlass/include/cute/util/print_tensor.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-14T21:02:50,543 copying 3rdparty/cutlass/include/cute/util/print_svg.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-14T21:02:50,545 copying 3rdparty/cutlass/include/cute/util/debug.hpp -> build/lib/flashinfer/data/cutlass/include/cute/util 2026-04-14T21:02:50,547 creating build/lib/flashinfer/data/cutlass/include/cutlass/platform 2026-04-14T21:02:50,549 copying 3rdparty/cutlass/include/cutlass/platform/platform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/platform 2026-04-14T21:02:50,552 copying 3rdparty/cutlass/include/cutlass/uint128.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:50,555 copying 3rdparty/cutlass/include/cutlass/tensor_ref_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:50,558 copying 3rdparty/cutlass/include/cutlass/kernel_hardware_info.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:50,560 copying 3rdparty/cutlass/include/cutlass/array.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:50,563 copying 3rdparty/cutlass/include/cutlass/cutlass.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:50,566 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,567 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,570 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor_callbacks.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,573 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/direct_store_epilogue_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,576 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,579 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,581 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,584 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,586 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_workspace.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,589 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,591 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,593 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,596 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_direct_store.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,598 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine_layout_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,601 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_depthwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,603 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_scaling_factor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,606 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_streamk_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,608 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,611 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/output_iterator_parameter.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,613 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,616 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,619 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_base_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,622 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op_blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,624 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_direct_conv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,627 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/interleaved_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,630 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,632 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,634 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_direct_store.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,637 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,640 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,643 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,646 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,649 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_mixed.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,652 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_smem_accumulator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,654 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-14T21:02:50,655 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_2x.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-14T21:02:50,658 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_load.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-14T21:02:50,661 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitors.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-14T21:02:50,663 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_store.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-14T21:02:50,666 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_compute.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-14T21:02:50,668 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,671 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,674 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,676 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/output_tile_thread_map.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,679 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op_blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,682 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_visitor_with_softmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,685 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_strided_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,687 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,690 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,692 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue_gemm_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,695 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,698 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_conv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,701 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_predicates.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,703 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,706 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,708 copying 3rdparty/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:50,711 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:50,712 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:50,715 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op_mixed.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:50,718 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:50,720 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:50,723 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/wmma_tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:50,725 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:50,728 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:50,731 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_gaussian_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:50,733 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:50,736 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:50,740 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/simt_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:50,743 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/fragment_iterator_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:50,746 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:50,749 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/tile_iterator_volta_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:50,752 copying 3rdparty/cutlass/include/cutlass/epilogue/warp/volta_tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:50,755 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,756 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_drelu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,759 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_silu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,761 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/activation.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,764 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_relu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,767 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_relu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,770 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,772 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_dgelu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,774 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_leaky_relu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,777 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_relu0.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,780 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_tensor_broadcast.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,782 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,784 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_sigmoid.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,786 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_generic_with_scaling.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,789 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,792 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/scale_type.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,794 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_with_elementwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,797 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,799 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_generic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,802 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_residual_block.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,805 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_gelu.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,807 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/reduction_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,809 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_elementwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,812 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_hardswish.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,814 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/linear_combination_clamp.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,817 copying 3rdparty/cutlass/include/cutlass/epilogue/thread/conversion_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:50,819 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:50,820 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:50,824 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:50,827 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:50,830 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:50,834 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:50,837 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/collective_epilogue.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:50,839 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_planar_complex_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:50,842 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized_array.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:50,845 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/collective_builder.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:50,848 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:50,853 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized_bias_elementwise.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:50,857 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-14T21:02:50,858 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm120_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-14T21:02:50,861 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm90_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-14T21:02:50,863 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm90_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-14T21:02:50,866 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm103_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-14T21:02:50,869 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm100_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-14T21:02:50,873 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/builders/sm120_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-14T21:02:50,876 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_nosmem.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:50,879 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_nosmem.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:50,882 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/epilogue_tensor_broadcast.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:50,885 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_nosmem.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:50,888 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/default_epilogue_array.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:50,891 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/default_epilogue.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:50,893 copying 3rdparty/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:50,897 creating build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:50,898 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_compute_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:50,901 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_callbacks_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:50,905 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/callbacks.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:50,907 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:50,910 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_load_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:50,913 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/operations.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:50,916 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_topk_softmax.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:50,919 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm120_callbacks_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:50,922 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_store_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:50,926 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_store_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:50,929 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm100_callbacks_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:50,933 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_compute_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:50,936 copying 3rdparty/cutlass/include/cutlass/epilogue/fusion/sm120_visitor_store_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:50,939 copying 3rdparty/cutlass/include/cutlass/epilogue/dispatch_policy.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/epilogue 2026-04-14T21:02:50,942 creating build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-14T21:02:50,944 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_base_schedule.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-14T21:02:50,947 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_1d_schedules.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-14T21:02:50,950 creating build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-14T21:02:50,951 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/device/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-14T21:02:50,953 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/device/full_barrier.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-14T21:02:50,956 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/device/dist_gemm_universal_wrapper.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-14T21:02:50,958 creating build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-14T21:02:50,959 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/kernel/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-14T21:02:50,962 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/kernel/full_barrier.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-14T21:02:50,964 copying 3rdparty/cutlass/include/cutlass/experimental/distributed/kernel/dist_gemm_kernel_wrapper.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-14T21:02:50,966 copying 3rdparty/cutlass/include/cutlass/uint256.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:50,968 creating build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:50,969 copying 3rdparty/cutlass/include/cutlass/arch/cache_operation.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:50,971 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:50,974 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm50.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:50,977 copying 3rdparty/cutlass/include/cutlass/arch/mma_sparse_sm89.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:50,980 copying 3rdparty/cutlass/include/cutlass/arch/mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:50,982 copying 3rdparty/cutlass/include/cutlass/arch/wmma_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:50,985 copying 3rdparty/cutlass/include/cutlass/arch/wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:50,987 copying 3rdparty/cutlass/include/cutlass/arch/config.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:50,990 copying 3rdparty/cutlass/include/cutlass/arch/arch.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:50,992 copying 3rdparty/cutlass/include/cutlass/arch/wmma_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:50,994 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:50,997 copying 3rdparty/cutlass/include/cutlass/arch/memory.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:51,000 copying 3rdparty/cutlass/include/cutlass/arch/memory_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:51,003 copying 3rdparty/cutlass/include/cutlass/arch/grid_dependency_control.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:51,005 copying 3rdparty/cutlass/include/cutlass/arch/simd_sm61.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:51,008 copying 3rdparty/cutlass/include/cutlass/arch/wmma_sm72.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:51,010 copying 3rdparty/cutlass/include/cutlass/arch/memory_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:51,013 copying 3rdparty/cutlass/include/cutlass/arch/synclog.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:51,015 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:51,018 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm60.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:51,021 copying 3rdparty/cutlass/include/cutlass/arch/reg_reconfig.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:51,023 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm89.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:51,026 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm61.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:51,028 copying 3rdparty/cutlass/include/cutlass/arch/barrier.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:51,031 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm90.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:51,033 copying 3rdparty/cutlass/include/cutlass/arch/simd_sm60.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:51,035 copying 3rdparty/cutlass/include/cutlass/arch/mma_sparse_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:51,038 copying 3rdparty/cutlass/include/cutlass/arch/mma_sm100.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:51,040 copying 3rdparty/cutlass/include/cutlass/arch/simd.h -> build/lib/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:51,042 copying 3rdparty/cutlass/include/cutlass/tensor_view_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,045 copying 3rdparty/cutlass/include/cutlass/numeric_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,047 copying 3rdparty/cutlass/include/cutlass/blas3_types.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,049 copying 3rdparty/cutlass/include/cutlass/kernel_hardware_info.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,051 copying 3rdparty/cutlass/include/cutlass/subbyte_reference.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,054 copying 3rdparty/cutlass/include/cutlass/tensor_ref.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,057 copying 3rdparty/cutlass/include/cutlass/floating_point_nvrtc.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,059 copying 3rdparty/cutlass/include/cutlass/real.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,061 copying 3rdparty/cutlass/include/cutlass/numeric_conversion.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,070 copying 3rdparty/cutlass/include/cutlass/semaphore.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,072 copying 3rdparty/cutlass/include/cutlass/tensor_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,074 copying 3rdparty/cutlass/include/cutlass/float_subbyte.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,078 copying 3rdparty/cutlass/include/cutlass/aligned_buffer.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,080 copying 3rdparty/cutlass/include/cutlass/block_striped.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,082 copying 3rdparty/cutlass/include/cutlass/constants.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,085 copying 3rdparty/cutlass/include/cutlass/functional.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,088 copying 3rdparty/cutlass/include/cutlass/matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,167 copying 3rdparty/cutlass/include/cutlass/coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,170 copying 3rdparty/cutlass/include/cutlass/exmy_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,173 copying 3rdparty/cutlass/include/cutlass/cluster_launch.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,176 copying 3rdparty/cutlass/include/cutlass/integer_subbyte.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,178 copying 3rdparty/cutlass/include/cutlass/version.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,180 creating build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-14T21:02:51,181 copying 3rdparty/cutlass/include/cutlass/layout/vector.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-14T21:02:51,184 copying 3rdparty/cutlass/include/cutlass/layout/matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-14T21:02:51,187 copying 3rdparty/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-14T21:02:51,190 copying 3rdparty/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-14T21:02:51,193 copying 3rdparty/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-14T21:02:51,196 copying 3rdparty/cutlass/include/cutlass/layout/tensor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-14T21:02:51,198 copying 3rdparty/cutlass/include/cutlass/layout/pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-14T21:02:51,201 copying 3rdparty/cutlass/include/cutlass/layout/permute.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-14T21:02:51,204 copying 3rdparty/cutlass/include/cutlass/layout/layout.h -> build/lib/flashinfer/data/cutlass/include/cutlass/layout 2026-04-14T21:02:51,206 copying 3rdparty/cutlass/include/cutlass/device_kernel.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:51,208 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-14T21:02:51,209 copying 3rdparty/cutlass/include/cutlass/gemm/gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-14T21:02:51,212 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,213 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_with_reduction_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,216 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,219 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/ell_mma_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,222 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,225 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,228 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_gemv_core.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,230 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sparse_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,233 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,236 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_access_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,238 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_layernorm_mainloop_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,241 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,244 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,246 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_trmm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,249 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,252 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,255 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_singlestage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,257 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_softmax_mainloop_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,260 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/ell_mma_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,263 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_trmm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,265 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_multistage_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,268 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,271 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,273 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,276 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,281 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm75.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,284 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,287 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_ell_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,290 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,293 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,295 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_multistage_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,298 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/index_remat.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,300 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,302 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_sparse_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,305 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,307 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,310 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,313 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_core.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,315 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_sparse_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,318 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_sparse_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,321 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_blas3_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,324 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/mma_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,326 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_softmax_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,329 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,331 copying 3rdparty/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:51,334 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,335 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,338 copying 3rdparty/cutlass/include/cutlass/gemm/warp/scale_bias_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,341 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,343 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,346 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fragment_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,349 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,352 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,354 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,356 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,361 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,363 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_with_reduction_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,366 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,369 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,371 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,374 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_wmma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,376 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,379 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_wmma_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,381 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op_tile_iterator_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,384 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sparse.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,387 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,391 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_with_reduction_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,393 copying 3rdparty/cutlass/include/cutlass/gemm/warp/layernorm_scale_bias_transform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,395 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_mixed_input_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,398 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_simt_policy.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,400 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_sparse_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,403 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fast_f32.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,405 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,407 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,410 copying 3rdparty/cutlass/include/cutlass/gemm/warp/tile_iterator_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,412 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_simt_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,416 copying 3rdparty/cutlass/include/cutlass/gemm/warp/default_mma_sparse_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,418 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,422 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_fast_f32.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,425 copying 3rdparty/cutlass/include/cutlass/gemm/warp/softmax_scale_bias_transform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,427 copying 3rdparty/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_tile_iterator_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:51,431 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,432 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_with_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,435 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,438 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,440 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,443 copying 3rdparty/cutlass/include/cutlass/gemm/device/rank_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,446 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemv_blockscaled.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,448 copying 3rdparty/cutlass/include/cutlass/gemm/device/symm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,451 copying 3rdparty/cutlass/include/cutlass/gemm/device/trmm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,453 copying 3rdparty/cutlass/include/cutlass/gemm/device/base_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,456 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,459 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,462 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,464 copying 3rdparty/cutlass/include/cutlass/gemm/device/rank_2k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,467 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_batched.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,470 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,474 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_splitk_parallel.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,477 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_adapter.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,480 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,483 copying 3rdparty/cutlass/include/cutlass/gemm/device/ell_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,486 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,488 copying 3rdparty/cutlass/include/cutlass/gemm/device/rank_2k_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,491 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,494 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,497 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,500 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,502 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,505 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,508 copying 3rdparty/cutlass/include/cutlass/gemm/device/default_gemm_configuration.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,511 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_universal_streamk_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,514 copying 3rdparty/cutlass/include/cutlass/gemm/device/gemm_sparse.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:51,517 copying 3rdparty/cutlass/include/cutlass/gemm/group_array_problem_shape.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-14T21:02:51,520 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-14T21:02:51,521 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma_sm50.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-14T21:02:51,524 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-14T21:02:51,527 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma_sm60.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-14T21:02:51,530 copying 3rdparty/cutlass/include/cutlass/gemm/thread/mma_sm61.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-14T21:02:51,533 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,534 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_array_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,538 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,542 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_mma_tma_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,545 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_rs_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,549 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_sparse_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,553 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,556 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,560 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,564 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_mma_decl.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,567 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,570 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,574 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,578 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,581 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,586 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,589 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,593 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_planar_complex.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,597 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,600 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_builder.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,603 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm70_mma_twostage.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,606 copying 3rdparty/cutlass/include/cutlass/gemm/collective/fp8_accumulation.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,609 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,613 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_rcggemm.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,617 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_sparse_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,621 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,625 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,629 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm120_mma_array_tma_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,632 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,636 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm80_mma_array_multistage.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,640 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,643 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_tf32.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,646 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,651 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,654 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,658 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,659 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_interleaved_complex_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,662 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockwise_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,665 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_config.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,669 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,671 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_sparse_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,675 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_sparse_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,678 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,680 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,683 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_interleaved_complex_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,687 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_blockwise_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,690 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_input_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,693 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_gmma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,696 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,699 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm1xx_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,702 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,705 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,708 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm90_gmma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,711 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,713 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm120_sparse_mma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,716 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_simt_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,718 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_tma_cpasync_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,721 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_sparse_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,724 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm1xx_sparse_config.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,726 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_cpasync_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,729 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_planar_complex_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,731 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_mixed_tma_cpasync_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,734 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm103_blockscaled_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,737 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,740 copying 3rdparty/cutlass/include/cutlass/gemm/collective/builders/sm100_pipeline_carveout.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:51,743 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,747 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,750 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm80_mma_multistage.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,753 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_array_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,757 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,760 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_sparse_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,763 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,767 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_mixed_input.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,771 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_planar_complex.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,774 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_blockwise_scaling.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,778 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,781 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_tf32.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,784 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_mma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,786 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_emulated.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,789 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,792 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,796 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_sparse_mma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,799 copying 3rdparty/cutlass/include/cutlass/gemm/collective/collective_builder_decl.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,802 copying 3rdparty/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized_rcggemm.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:51,806 copying 3rdparty/cutlass/include/cutlass/gemm/dispatch_policy.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-14T21:02:51,809 copying 3rdparty/cutlass/include/cutlass/gemm/gemm_enumerated_types.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-14T21:02:51,811 creating build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,812 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_with_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,815 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_input_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,819 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,822 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_softmax_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,825 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,827 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/static_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,830 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,834 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,837 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_group.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,840 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,845 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,849 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,852 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,856 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_universal_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,859 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_symm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,862 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,866 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped_per_group_scale.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,870 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,873 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_streamk_with_fused_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,878 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_cooperative.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,882 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,886 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_symm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,890 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemv_blockscaled.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,895 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,898 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm70_gemm.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,901 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,905 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_splitk_parallel.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,908 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,913 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,917 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,921 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_streamk_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,924 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,928 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,931 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,935 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_cooperative.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,940 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped_problem_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,943 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,946 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_trmm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,948 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_mma_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,951 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_trmm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,954 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_transpose_operands.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,956 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,959 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,962 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_per_group_scale.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,965 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_transpose_operands.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,968 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,971 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,974 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,977 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_decl.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,980 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_cooperative.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,983 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,987 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,990 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_trmm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,993 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,996 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:51,999 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,002 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_static_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,004 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,007 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/params_universal_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,009 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/tile_scheduler_detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,011 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_group.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,014 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_batched.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,016 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm120_gemm_tma_warpspecialized_cooperative_asymmetric_dma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,019 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemv_batched_strided.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,022 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_planar_complex_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,025 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,027 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,030 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_splitk_parallel.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,032 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped_problem_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,035 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_layernorm_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,037 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,040 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_stream_k.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,044 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_sparse_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,048 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mixed_input_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,053 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_symm_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,056 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_input_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,059 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_k_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,062 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped_softmax_mainloop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,064 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,067 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_blockwise.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,069 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_ell_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,072 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,075 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,077 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,080 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/symm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,082 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,085 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/ell_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,088 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,090 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,092 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_with_fused_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,095 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,098 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,101 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm70_gemm_array.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,103 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_with_k_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,106 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,109 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_stream_k.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,112 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/tile_scheduler_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,116 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_mixed_tma_cpasync_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,119 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,122 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,125 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_pingpong.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,127 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sparse_gemm.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,130 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor_streamk.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,133 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/gemm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,136 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,138 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,140 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_rank_2k_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,143 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mma_transform.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,146 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,149 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/grouped_problem_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,151 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/trmm_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,154 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_visitor.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,157 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/rank_k_universal.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,159 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/default_gemv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,162 copying 3rdparty/cutlass/include/cutlass/gemm/kernel/params_sparse_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:52,164 copying 3rdparty/cutlass/include/cutlass/workspace.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,166 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,167 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op_sm70.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,170 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear_2dthreadtile.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,173 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_2dthreadtile.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,176 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/ell_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,179 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,181 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_2dthreadtile.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,184 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_scale_bias_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,187 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,190 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_triangular_matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,193 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,195 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,197 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear_direct_conv.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,200 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,203 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,207 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/vector_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,210 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,213 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,216 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op_sm80.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,220 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,223 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,228 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,230 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,234 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,236 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,239 copying 3rdparty/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_triangular_matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:52,241 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/warp 2026-04-14T21:02:52,242 copying 3rdparty/cutlass/include/cutlass/transform/warp/vector_fragment_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/warp 2026-04-14T21:02:52,245 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/device 2026-04-14T21:02:52,246 copying 3rdparty/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/device 2026-04-14T21:02:52,248 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-14T21:02:52,249 copying 3rdparty/cutlass/include/cutlass/transform/thread/transpose.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-14T21:02:52,251 copying 3rdparty/cutlass/include/cutlass/transform/thread/unary_op.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-14T21:02:52,254 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/collective 2026-04-14T21:02:52,254 copying 3rdparty/cutlass/include/cutlass/transform/collective/sm90_wgmma_transpose.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/collective 2026-04-14T21:02:52,258 copying 3rdparty/cutlass/include/cutlass/transform/pitch_linear_thread_map.h -> build/lib/flashinfer/data/cutlass/include/cutlass/transform 2026-04-14T21:02:52,260 creating build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-14T21:02:52,261 copying 3rdparty/cutlass/include/cutlass/transform/kernel/filter_format_transformer.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-14T21:02:52,264 copying 3rdparty/cutlass/include/cutlass/transform/kernel/sparse_gemm_compressor.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-14T21:02:52,266 copying 3rdparty/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-14T21:02:52,269 copying 3rdparty/cutlass/include/cutlass/wmma_array.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,271 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,272 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,275 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,278 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,280 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,282 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_fprop_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,285 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,288 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,290 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,292 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/threadblock_swizzle.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,295 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,297 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,300 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,303 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_fixed_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,305 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,308 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_few_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,310 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,313 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_direct_conv_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,316 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,318 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,321 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_few_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,324 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_mma_core_with_lane_access_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,327 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_mma_base.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,329 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,332 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_pipelined.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,334 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,337 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_fixed_channels.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,339 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_filter_tile_access_iterator_direct_conv_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,342 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,345 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_fixed_stride_dilation.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,348 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,350 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_direct_conv_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,353 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,355 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,358 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,360 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_params.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,363 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,366 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,368 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,371 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,373 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,376 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,378 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,381 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_optimized.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,384 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,386 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/implicit_gemm_wgrad_fusion_multistage.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,389 copying 3rdparty/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_analytic.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:52,392 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-14T21:02:52,393 copying 3rdparty/cutlass/include/cutlass/conv/warp/mma_depthwise_simt_tile_iterator.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-14T21:02:52,396 copying 3rdparty/cutlass/include/cutlass/conv/warp/scale_bias_relu_transform.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-14T21:02:52,399 copying 3rdparty/cutlass/include/cutlass/conv/warp/mma_depthwise_simt.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-14T21:02:52,402 copying 3rdparty/cutlass/include/cutlass/conv/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-14T21:02:52,404 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-14T21:02:52,405 copying 3rdparty/cutlass/include/cutlass/conv/device/direct_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-14T21:02:52,408 copying 3rdparty/cutlass/include/cutlass/conv/device/implicit_gemm_convolution_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-14T21:02:52,410 copying 3rdparty/cutlass/include/cutlass/conv/device/implicit_gemm_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-14T21:02:52,413 copying 3rdparty/cutlass/include/cutlass/conv/device/conv_universal_adapter.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-14T21:02:52,416 copying 3rdparty/cutlass/include/cutlass/conv/conv2d_problem_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-14T21:02:52,419 copying 3rdparty/cutlass/include/cutlass/conv/convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-14T21:02:52,421 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/thread 2026-04-14T21:02:52,422 copying 3rdparty/cutlass/include/cutlass/conv/thread/depthwise_mma.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/thread 2026-04-14T21:02:52,424 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-14T21:02:52,425 copying 3rdparty/cutlass/include/cutlass/conv/collective/detail.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-14T21:02:52,428 copying 3rdparty/cutlass/include/cutlass/conv/collective/sm100_implicit_gemm_umma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-14T21:02:52,431 copying 3rdparty/cutlass/include/cutlass/conv/collective/sm90_implicit_gemm_gmma_ss_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-14T21:02:52,434 copying 3rdparty/cutlass/include/cutlass/conv/collective/collective_builder.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-14T21:02:52,436 copying 3rdparty/cutlass/include/cutlass/conv/collective/collective_conv.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-14T21:02:52,438 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-14T21:02:52,439 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm100_umma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-14T21:02:52,442 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm90_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-14T21:02:52,444 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm90_gmma_builder.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-14T21:02:52,447 copying 3rdparty/cutlass/include/cutlass/conv/collective/builders/sm100_common.inl -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-14T21:02:52,449 copying 3rdparty/cutlass/include/cutlass/conv/dispatch_policy.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-14T21:02:52,452 copying 3rdparty/cutlass/include/cutlass/conv/conv3d_problem_size.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-14T21:02:52,455 creating build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,456 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv3d_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,458 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv2d.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,461 copying 3rdparty/cutlass/include/cutlass/conv/kernel/direct_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,464 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,467 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_reduction.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,469 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,471 copying 3rdparty/cutlass/include/cutlass/conv/kernel/sm100_implicit_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,474 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,477 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,479 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,482 copying 3rdparty/cutlass/include/cutlass/conv/kernel/conv_universal.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,484 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,486 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,489 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,491 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv3d.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,494 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,496 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_fused_epilogue.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,499 copying 3rdparty/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_strided_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,502 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,504 copying 3rdparty/cutlass/include/cutlass/conv/kernel/sm90_implicit_gemm_tma_warpspecialized.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,506 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_absmax.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,509 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_deconv2d_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,511 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,516 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_group_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,519 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_wgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,522 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_with_broadcast.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,524 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_depthwise_fprop.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,527 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_dgrad.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,532 copying 3rdparty/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_fusion.h -> build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:52,534 copying 3rdparty/cutlass/include/cutlass/conv/convnd_problem_shape.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/conv 2026-04-14T21:02:52,538 copying 3rdparty/cutlass/include/cutlass/complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,541 creating build/lib/flashinfer/data/cutlass/include/cutlass/thread 2026-04-14T21:02:52,542 copying 3rdparty/cutlass/include/cutlass/thread/matrix.h -> build/lib/flashinfer/data/cutlass/include/cutlass/thread 2026-04-14T21:02:52,545 copying 3rdparty/cutlass/include/cutlass/half.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,547 copying 3rdparty/cutlass/include/cutlass/array_planar_complex.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,549 copying 3rdparty/cutlass/include/cutlass/tensor_view.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,552 copying 3rdparty/cutlass/include/cutlass/numeric_types.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,554 copying 3rdparty/cutlass/include/cutlass/matrix_shape.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,556 copying 3rdparty/cutlass/include/cutlass/quaternion.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,559 copying 3rdparty/cutlass/include/cutlass/float8.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,562 creating build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:52,563 copying 3rdparty/cutlass/include/cutlass/detail/mainloop_fusion_helper_scale_factor.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:52,565 copying 3rdparty/cutlass/include/cutlass/detail/collective.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:52,568 copying 3rdparty/cutlass/include/cutlass/detail/sm100_mixed_dtype_blockwise_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:52,570 copying 3rdparty/cutlass/include/cutlass/detail/dependent_false.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:52,572 copying 3rdparty/cutlass/include/cutlass/detail/blockwise_scale_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:52,575 copying 3rdparty/cutlass/include/cutlass/detail/mma.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:52,577 copying 3rdparty/cutlass/include/cutlass/detail/layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:52,579 copying 3rdparty/cutlass/include/cutlass/detail/sm103_blockscaled_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:52,581 creating build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-14T21:02:52,582 copying 3rdparty/cutlass/include/cutlass/detail/collective/moe_stride_utils.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-14T21:02:52,585 copying 3rdparty/cutlass/include/cutlass/detail/collective/sm103_kernel_type.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-14T21:02:52,587 copying 3rdparty/cutlass/include/cutlass/detail/collective/mixed_input_utils.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-14T21:02:52,590 copying 3rdparty/cutlass/include/cutlass/detail/sm100_tmem_helper.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:52,592 copying 3rdparty/cutlass/include/cutlass/detail/cluster.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:52,595 copying 3rdparty/cutlass/include/cutlass/detail/sm100_blockscaled_layout.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:52,598 copying 3rdparty/cutlass/include/cutlass/detail/helper_macros.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:52,600 copying 3rdparty/cutlass/include/cutlass/blas3.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,602 copying 3rdparty/cutlass/include/cutlass/fast_math.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,605 copying 3rdparty/cutlass/include/cutlass/tfloat32.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,607 copying 3rdparty/cutlass/include/cutlass/array_subbyte.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,610 copying 3rdparty/cutlass/include/cutlass/bfloat16.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,612 creating build/lib/flashinfer/data/cutlass/include/cutlass/reduction 2026-04-14T21:02:52,613 copying 3rdparty/cutlass/include/cutlass/reduction/threadblock_swizzle.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction 2026-04-14T21:02:52,615 creating build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-14T21:02:52,616 copying 3rdparty/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_contiguous.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-14T21:02:52,618 copying 3rdparty/cutlass/include/cutlass/reduction/device/tensor_reduce.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-14T21:02:52,621 copying 3rdparty/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_strided.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-14T21:02:52,623 copying 3rdparty/cutlass/include/cutlass/reduction/device/reduce_split_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-14T21:02:52,625 creating build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-14T21:02:52,626 copying 3rdparty/cutlass/include/cutlass/reduction/thread/reduction_operators.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-14T21:02:52,629 copying 3rdparty/cutlass/include/cutlass/reduction/thread/reduce.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-14T21:02:52,631 creating build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-14T21:02:52,632 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_contiguous.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-14T21:02:52,635 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/reduce_softmax_final.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-14T21:02:52,637 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_strided.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-14T21:02:52,640 copying 3rdparty/cutlass/include/cutlass/reduction/kernel/reduce_split_k.h -> build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-14T21:02:52,642 copying 3rdparty/cutlass/include/cutlass/relatively_equal.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,645 copying 3rdparty/cutlass/include/cutlass/gemm_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,647 copying 3rdparty/cutlass/include/cutlass/barrier.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,650 copying 3rdparty/cutlass/include/cutlass/cuda_host_adapter.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,652 copying 3rdparty/cutlass/include/cutlass/predicate_vector.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,655 copying 3rdparty/cutlass/include/cutlass/core_io.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,657 copying 3rdparty/cutlass/include/cutlass/pitch_linear_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,659 copying 3rdparty/cutlass/include/cutlass/matrix_coord.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,662 copying 3rdparty/cutlass/include/cutlass/kernel_launch.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,664 creating build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-14T21:02:52,665 copying 3rdparty/cutlass/include/cutlass/pipeline/pipeline.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-14T21:02:52,667 copying 3rdparty/cutlass/include/cutlass/pipeline/sm90_pipeline.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-14T21:02:52,670 copying 3rdparty/cutlass/include/cutlass/pipeline/sm100_pipeline.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-14T21:02:52,673 copying 3rdparty/cutlass/include/cutlass/trace.h -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,675 copying 3rdparty/cutlass/include/cutlass/gemm_coord.hpp -> build/lib/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:52,677 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,679 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_uncompress.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,682 copying 3rdparty/cutlass/tools/util/include/cutlass/util/exceptions.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,684 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_tensor_planar_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,687 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nhwc_padding.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,689 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:52,691 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:52,694 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:52,697 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_relu.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:52,700 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:52,702 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_reduce.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:52,705 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gemm_planar_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:52,708 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/rank_2k_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:52,710 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/convolution.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:52,714 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/tensor_compare.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:52,716 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2026-04-14T21:02:52,717 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/thread/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2026-04-14T21:02:52,720 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-14T21:02:52,721 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/kernel/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-14T21:02:52,723 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_elementwise.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-14T21:02:52,726 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-14T21:02:52,728 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gemm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:52,731 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/device/gett.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:52,733 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,734 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gemm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,737 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_elementwise.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,739 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,742 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/rank_k_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,745 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/symm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,747 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/trmm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,749 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_foreach.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,752 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/symm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,754 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,756 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,759 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,761 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_copy.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,764 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gemm_planar_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,766 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/trmm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,769 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,772 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/convolution.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,775 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,777 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,779 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/conv.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,782 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/error_metrics.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,784 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_norm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,786 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gemm_complex.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,789 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/gett.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,792 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:52,794 creating build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-14T21:02:52,795 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/detail/linear_to_coordinate.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-14T21:02:52,797 copying 3rdparty/cutlass/tools/util/include/cutlass/util/reference/detail/inner_product.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-14T21:02:52,800 copying 3rdparty/cutlass/tools/util/include/cutlass/util/print_error.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,802 copying 3rdparty/cutlass/tools/util/include/cutlass/util/packed_stride.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,805 copying 3rdparty/cutlass/tools/util/include/cutlass/util/tensor_view_io.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,807 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_memory.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,810 copying 3rdparty/cutlass/tools/util/include/cutlass/util/cublas_wrappers.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,813 copying 3rdparty/cutlass/tools/util/include/cutlass/util/command_line.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,815 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_dump.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,818 copying 3rdparty/cutlass/tools/util/include/cutlass/util/gett_commandline.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,820 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_tensor.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,823 copying 3rdparty/cutlass/tools/util/include/cutlass/util/distribution.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,826 copying 3rdparty/cutlass/tools/util/include/cutlass/util/debug.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,828 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nhwc_to_nchw.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,830 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nhwc_pooling.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,833 copying 3rdparty/cutlass/tools/util/include/cutlass/util/helper_cuda.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,835 copying 3rdparty/cutlass/tools/util/include/cutlass/util/index_sequence.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,837 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_utils.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,839 copying 3rdparty/cutlass/tools/util/include/cutlass/util/type_traits.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,841 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_layernorm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,844 copying 3rdparty/cutlass/tools/util/include/cutlass/util/host_reorder.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,847 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_rmsnorm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,849 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_nchw_to_nhwc.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,851 copying 3rdparty/cutlass/tools/util/include/cutlass/util/GPU_Clock.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,853 copying 3rdparty/cutlass/tools/util/include/cutlass/util/mixed_dtype_utils.hpp -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,856 copying 3rdparty/cutlass/tools/util/include/cutlass/util/device_groupnorm.h -> build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:52,983 installing to build/bdist.linux-armv7l/wheel 2026-04-14T21:02:52,983 running install 2026-04-14T21:02:53,006 running install_lib 2026-04-14T21:02:53,012 creating build/bdist.linux-armv7l/wheel 2026-04-14T21:02:53,014 copying build/lib/build_backend.py -> build/bdist.linux-armv7l/wheel/. 2026-04-14T21:02:53,017 creating build/bdist.linux-armv7l/wheel/flashinfer 2026-04-14T21:02:53,018 copying build/lib/flashinfer/sampling.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:53,023 creating build/bdist.linux-armv7l/wheel/flashinfer/cudnn 2026-04-14T21:02:53,024 copying build/lib/flashinfer/cudnn/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2026-04-14T21:02:53,026 copying build/lib/flashinfer/cudnn/decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2026-04-14T21:02:53,028 copying build/lib/flashinfer/cudnn/prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2026-04-14T21:02:53,031 copying build/lib/flashinfer/cudnn/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cudnn 2026-04-14T21:02:53,033 copying build/lib/flashinfer/compilation_context.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:53,035 copying build/lib/flashinfer/pod.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:53,038 creating build/bdist.linux-armv7l/wheel/flashinfer/norm 2026-04-14T21:02:53,039 copying build/lib/flashinfer/norm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm 2026-04-14T21:02:53,043 creating build/bdist.linux-armv7l/wheel/flashinfer/norm/kernels 2026-04-14T21:02:53,044 copying build/lib/flashinfer/norm/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm/kernels 2026-04-14T21:02:53,046 copying build/lib/flashinfer/norm/kernels/layernorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm/kernels 2026-04-14T21:02:53,048 copying build/lib/flashinfer/norm/kernels/fused_add_rmsnorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm/kernels 2026-04-14T21:02:53,050 copying build/lib/flashinfer/norm/kernels/rmsnorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm/kernels 2026-04-14T21:02:53,053 copying build/lib/flashinfer/norm/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/norm 2026-04-14T21:02:53,056 creating build/bdist.linux-armv7l/wheel/flashinfer/logits_processor 2026-04-14T21:02:53,057 copying build/lib/flashinfer/logits_processor/pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-14T21:02:53,059 copying build/lib/flashinfer/logits_processor/legalization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-14T21:02:53,061 copying build/lib/flashinfer/logits_processor/fusion_rules.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-14T21:02:53,063 copying build/lib/flashinfer/logits_processor/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-14T21:02:53,065 copying build/lib/flashinfer/logits_processor/operators.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-14T21:02:53,067 copying build/lib/flashinfer/logits_processor/validators.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-14T21:02:53,069 copying build/lib/flashinfer/logits_processor/op.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-14T21:02:53,071 copying build/lib/flashinfer/logits_processor/compiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-14T21:02:53,073 copying build/lib/flashinfer/logits_processor/processors.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-14T21:02:53,075 copying build/lib/flashinfer/logits_processor/types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/logits_processor 2026-04-14T21:02:53,077 copying build/lib/flashinfer/gdn_prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:53,079 copying build/lib/flashinfer/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:53,082 copying build/lib/flashinfer/gdn_decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:53,085 copying build/lib/flashinfer/decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:53,089 creating build/bdist.linux-armv7l/wheel/flashinfer/fused_moe 2026-04-14T21:02:53,090 copying build/lib/flashinfer/fused_moe/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2026-04-14T21:02:53,092 copying build/lib/flashinfer/fused_moe/fused_routing_dsv3.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2026-04-14T21:02:53,095 creating build/bdist.linux-armv7l/wheel/flashinfer/fused_moe/cute_dsl 2026-04-14T21:02:53,096 copying build/lib/flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_grouped_gemm_finalize_fusion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-14T21:02:53,099 copying build/lib/flashinfer/fused_moe/cute_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-14T21:02:53,100 copying build/lib/flashinfer/fused_moe/cute_dsl/fused_moe.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-14T21:02:53,103 copying build/lib/flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-14T21:02:53,106 copying build/lib/flashinfer/fused_moe/cute_dsl/tuner.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-14T21:02:53,108 copying build/lib/flashinfer/fused_moe/cute_dsl/moe_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl 2026-04-14T21:02:53,112 creating build/bdist.linux-armv7l/wheel/flashinfer/fused_moe/cute_dsl/blackwell 2026-04-14T21:02:53,113 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_grouped_gemm_finalize_fusion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-14T21:02:53,117 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-14T21:02:53,118 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-14T21:02:53,123 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/custom_pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-14T21:02:53,126 copying build/lib/flashinfer/fused_moe/cute_dsl/blackwell/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe/cute_dsl/blackwell 2026-04-14T21:02:53,128 copying build/lib/flashinfer/fused_moe/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2026-04-14T21:02:53,132 copying build/lib/flashinfer/fused_moe/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/fused_moe 2026-04-14T21:02:53,134 copying build/lib/flashinfer/fp8_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:53,136 copying build/lib/flashinfer/version.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:53,139 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl 2026-04-14T21:02:53,140 copying build/lib/flashinfer/cute_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-14T21:02:53,142 copying build/lib/flashinfer/cute_dsl/rmsnorm_fp4quant.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-14T21:02:53,145 copying build/lib/flashinfer/cute_dsl/gemm_allreduce_two_shot.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-14T21:02:53,148 copying build/lib/flashinfer/cute_dsl/blockscaled_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-14T21:02:53,150 copying build/lib/flashinfer/cute_dsl/fp4_common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-14T21:02:53,154 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl/attention 2026-04-14T21:02:53,155 copying build/lib/flashinfer/cute_dsl/attention/pipeline_topology.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-14T21:02:53,157 copying build/lib/flashinfer/cute_dsl/attention/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-14T21:02:53,159 copying build/lib/flashinfer/cute_dsl/attention/tmem_layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-14T21:02:53,161 copying build/lib/flashinfer/cute_dsl/attention/mla_decode_fp8.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-14T21:02:53,164 copying build/lib/flashinfer/cute_dsl/attention/config.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-14T21:02:53,166 copying build/lib/flashinfer/cute_dsl/attention/mla_warp_schedule.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-14T21:02:53,169 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:53,170 copying build/lib/flashinfer/cute_dsl/attention/roles/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:53,172 copying build/lib/flashinfer/cute_dsl/attention/roles/softmax.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:53,175 copying build/lib/flashinfer/cute_dsl/attention/roles/correction.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:53,178 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:53,180 copying build/lib/flashinfer/cute_dsl/attention/roles/softmax_math.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:53,182 copying build/lib/flashinfer/cute_dsl/attention/roles/loader_tma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:53,184 copying build/lib/flashinfer/cute_dsl/attention/roles/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:53,187 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_loader.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:53,189 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_pt_loader.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:53,191 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_correction.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:53,194 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_mma_fp8.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:53,196 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_compute.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:53,199 copying build/lib/flashinfer/cute_dsl/attention/roles/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:53,201 copying build/lib/flashinfer/cute_dsl/attention/roles/mla_loader_fp8.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/roles 2026-04-14T21:02:53,203 copying build/lib/flashinfer/cute_dsl/attention/warp_schedule.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-14T21:02:53,205 copying build/lib/flashinfer/cute_dsl/attention/collective_builder.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-14T21:02:53,208 copying build/lib/flashinfer/cute_dsl/attention/mla_config.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-14T21:02:53,210 copying build/lib/flashinfer/cute_dsl/attention/prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-14T21:02:53,213 copying build/lib/flashinfer/cute_dsl/attention/compat.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-14T21:02:53,215 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl/attention/fusion 2026-04-14T21:02:53,216 copying build/lib/flashinfer/cute_dsl/attention/fusion/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/fusion 2026-04-14T21:02:53,218 copying build/lib/flashinfer/cute_dsl/attention/fusion/mask.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/fusion 2026-04-14T21:02:53,220 copying build/lib/flashinfer/cute_dsl/attention/fusion/variant.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/fusion 2026-04-14T21:02:53,223 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl/attention/wrappers 2026-04-14T21:02:53,224 copying build/lib/flashinfer/cute_dsl/attention/wrappers/batch_prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/wrappers 2026-04-14T21:02:53,227 copying build/lib/flashinfer/cute_dsl/attention/wrappers/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/wrappers 2026-04-14T21:02:53,228 copying build/lib/flashinfer/cute_dsl/attention/wrappers/batch_mla.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/wrappers 2026-04-14T21:02:53,232 creating build/bdist.linux-armv7l/wheel/flashinfer/cute_dsl/attention/scheduler 2026-04-14T21:02:53,233 copying build/lib/flashinfer/cute_dsl/attention/scheduler/mla_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/scheduler 2026-04-14T21:02:53,235 copying build/lib/flashinfer/cute_dsl/attention/scheduler/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/scheduler 2026-04-14T21:02:53,238 copying build/lib/flashinfer/cute_dsl/attention/scheduler/persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention/scheduler 2026-04-14T21:02:53,240 copying build/lib/flashinfer/cute_dsl/attention/mla_decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-14T21:02:53,242 copying build/lib/flashinfer/cute_dsl/attention/mainloop_spec.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl/attention 2026-04-14T21:02:53,244 copying build/lib/flashinfer/cute_dsl/add_rmsnorm_fp4quant.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-14T21:02:53,247 copying build/lib/flashinfer/cute_dsl/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/cute_dsl 2026-04-14T21:02:53,249 copying build/lib/flashinfer/_build_meta.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:53,251 copying build/lib/flashinfer/deep_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:53,255 creating build/bdist.linux-armv7l/wheel/flashinfer/gdn_kernels 2026-04-14T21:02:53,256 copying build/lib/flashinfer/gdn_kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-14T21:02:53,258 copying build/lib/flashinfer/gdn_kernels/gdn_decode_nontranspose.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-14T21:02:53,260 copying build/lib/flashinfer/gdn_kernels/gdn_decode_pretranspose.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-14T21:02:53,264 creating build/bdist.linux-armv7l/wheel/flashinfer/gdn_kernels/blackwell 2026-04-14T21:02:53,265 copying build/lib/flashinfer/gdn_kernels/blackwell/gated_delta_net_chunked.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels/blackwell 2026-04-14T21:02:53,271 copying build/lib/flashinfer/gdn_kernels/blackwell/gdn_prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels/blackwell 2026-04-14T21:02:53,273 copying build/lib/flashinfer/gdn_kernels/blackwell/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels/blackwell 2026-04-14T21:02:53,275 copying build/lib/flashinfer/gdn_kernels/blackwell/gated_delta_net_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels/blackwell 2026-04-14T21:02:53,277 copying build/lib/flashinfer/gdn_kernels/gdn_decode_bf16_state.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-14T21:02:53,282 copying build/lib/flashinfer/gdn_kernels/gdn_decode_mtp.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gdn_kernels 2026-04-14T21:02:53,287 creating build/bdist.linux-armv7l/wheel/flashinfer/testing 2026-04-14T21:02:53,288 copying build/lib/flashinfer/testing/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/testing 2026-04-14T21:02:53,290 copying build/lib/flashinfer/testing/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/testing 2026-04-14T21:02:53,293 copying build/lib/flashinfer/sparse.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:53,296 copying build/lib/flashinfer/xqa.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:53,299 creating build/bdist.linux-armv7l/wheel/flashinfer/tuning_configs 2026-04-14T21:02:53,300 copying build/lib/flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_B200.py -> build/bdist.linux-armv7l/wheel/./flashinfer/tuning_configs 2026-04-14T21:02:53,302 copying build/lib/flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_GB200.py -> build/bdist.linux-armv7l/wheel/./flashinfer/tuning_configs 2026-04-14T21:02:53,305 copying build/lib/flashinfer/tllm_enums.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:53,307 creating build/bdist.linux-armv7l/wheel/flashinfer/data 2026-04-14T21:02:53,309 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog 2026-04-14T21:02:53,311 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include 2026-04-14T21:02:53,312 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:53,313 copying build/lib/flashinfer/data/spdlog/include/spdlog/async_logger-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:53,316 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,317 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/mpmc_blocking_q.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,319 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/file_helper-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,321 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/file_helper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,323 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/backtracer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,325 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/synchronous_factory.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,327 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/fmt_helper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,329 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/tcp_client-windows.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,331 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/tcp_client.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,332 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/periodic_worker-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,334 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/circular_q.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,336 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/os.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,338 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/null_mutex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,340 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/os-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,342 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/thread_pool-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,345 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,347 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/periodic_worker.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,348 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/console_globals.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,350 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/windows_include.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,352 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/udp_client-windows.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,354 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,356 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/backtracer-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,358 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/thread_pool.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,360 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,361 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/log_msg.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,363 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/registry-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,365 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/udp_client.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,367 copying build/lib/flashinfer/data/spdlog/include/spdlog/details/registry.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/details 2026-04-14T21:02:53,369 copying build/lib/flashinfer/data/spdlog/include/spdlog/mdc.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:53,371 copying build/lib/flashinfer/data/spdlog/include/spdlog/pattern_formatter-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:53,374 copying build/lib/flashinfer/data/spdlog/include/spdlog/common-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:53,376 copying build/lib/flashinfer/data/spdlog/include/spdlog/tweakme.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:53,378 copying build/lib/flashinfer/data/spdlog/include/spdlog/async.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:53,380 copying build/lib/flashinfer/data/spdlog/include/spdlog/formatter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:53,382 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/fmt 2026-04-14T21:02:53,383 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/chrono.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-14T21:02:53,385 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/compile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-14T21:02:53,387 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bin_to_hex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-14T21:02:53,390 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:53,391 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/chrono.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:53,394 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/args.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:53,397 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:53,400 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/compile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:53,403 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/xchar.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:53,406 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/color.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:53,408 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/fmt.license.rst -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:53,410 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/os.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:53,412 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/std.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:53,415 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/format-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:53,418 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/ranges.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:53,421 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/format.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:53,425 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/locale.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:53,427 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/ostream.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:53,429 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/bundled/printf.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt/bundled 2026-04-14T21:02:53,432 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/xchar.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-14T21:02:53,433 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/std.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-14T21:02:53,435 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/ranges.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-14T21:02:53,437 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/fmt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-14T21:02:53,439 copying build/lib/flashinfer/data/spdlog/include/spdlog/fmt/ostr.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/fmt 2026-04-14T21:02:53,441 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,442 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/null_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,444 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ostream_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,446 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,448 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,451 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,452 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/base_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,454 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,456 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/mongo_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,458 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/android_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,460 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,462 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ringbuffer_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,464 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/qt_sinks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,466 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,468 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,470 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/dup_filter_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,472 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,473 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,476 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/tcp_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,477 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/systemd_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,480 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/syslog_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,482 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/win_eventlog_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,484 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/dist_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,486 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,488 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/udp_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,490 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,492 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/callback_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,494 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/kafka_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,496 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/daily_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,498 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,500 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,502 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/hourly_file_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,504 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,506 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/msvc_sink.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,508 copying build/lib/flashinfer/data/spdlog/include/spdlog/sinks/base_sink-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/sinks 2026-04-14T21:02:53,510 copying build/lib/flashinfer/data/spdlog/include/spdlog/spdlog-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:53,511 copying build/lib/flashinfer/data/spdlog/include/spdlog/logger.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:53,514 copying build/lib/flashinfer/data/spdlog/include/spdlog/version.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:53,515 copying build/lib/flashinfer/data/spdlog/include/spdlog/common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:53,518 copying build/lib/flashinfer/data/spdlog/include/spdlog/pattern_formatter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:53,520 copying build/lib/flashinfer/data/spdlog/include/spdlog/fwd.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:53,521 copying build/lib/flashinfer/data/spdlog/include/spdlog/logger-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:53,524 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/include/spdlog/cfg 2026-04-14T21:02:53,525 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/helpers-inl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2026-04-14T21:02:53,527 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/helpers.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2026-04-14T21:02:53,529 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/argv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2026-04-14T21:02:53,531 copying build/lib/flashinfer/data/spdlog/include/spdlog/cfg/env.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog/cfg 2026-04-14T21:02:53,532 copying build/lib/flashinfer/data/spdlog/include/spdlog/spdlog.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:53,535 copying build/lib/flashinfer/data/spdlog/include/spdlog/async_logger.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:53,537 copying build/lib/flashinfer/data/spdlog/include/spdlog/stopwatch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/include/spdlog 2026-04-14T21:02:53,539 creating build/bdist.linux-armv7l/wheel/flashinfer/data/spdlog/scripts 2026-04-14T21:02:53,540 copying build/lib/flashinfer/data/spdlog/scripts/extract_version.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/spdlog/scripts 2026-04-14T21:02:53,542 copying build/lib/flashinfer/data/build_backend.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data 2026-04-14T21:02:53,545 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass 2026-04-14T21:02:53,546 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test 2026-04-14T21:02:53,548 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/utils 2026-04-14T21:02:53,549 copying build/lib/flashinfer/data/cutlass/test/utils/test_sharding.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/utils 2026-04-14T21:02:53,552 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python 2026-04-14T21:02:53,554 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/pycute 2026-04-14T21:02:53,555 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_complement.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-14T21:02:53,557 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_int_tuple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-14T21:02:53,559 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_composition.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-14T21:02:53,561 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-14T21:02:53,564 copying build/lib/flashinfer/data/cutlass/test/python/pycute/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-14T21:02:53,566 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_left_inverse.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-14T21:02:53,567 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_right_inverse.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-14T21:02:53,569 copying build/lib/flashinfer/data/cutlass/test/python/pycute/test_coalesce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/pycute 2026-04-14T21:02:53,571 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass 2026-04-14T21:02:53,573 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:53,574 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f32_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:53,577 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:53,579 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:53,581 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:53,583 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:53,585 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_mixed_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:53,586 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_batched.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:53,589 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:53,591 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:53,593 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_testbed.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:53,595 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f8_sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:53,598 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:53,600 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/gemm/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/gemm 2026-04-14T21:02:53,602 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-14T21:02:53,603 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/gemm_interface.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-14T21:02:53,606 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/conv2d_interface.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-14T21:02:53,608 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/evt_interface.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-14T21:02:53,611 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/interface/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/interface 2026-04-14T21:02:53,612 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/installation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass 2026-04-14T21:02:53,615 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-14T21:02:53,616 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/evt/utils 2026-04-14T21:02:53,617 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/utils/evt_testbed.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt/utils 2026-04-14T21:02:53,619 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-14T21:02:53,621 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_load_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-14T21:02:53,624 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_store_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-14T21:02:53,626 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_layout_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-14T21:02:53,628 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_mixed_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-14T21:02:53,630 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/evt/evt_compute_sm80_90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/evt 2026-04-14T21:02:53,633 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/emit 2026-04-14T21:02:53,634 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/emit/pytorch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/emit 2026-04-14T21:02:53,637 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-14T21:02:53,638 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_test_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-14T21:02:53,641 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_sm80.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-14T21:02:53,643 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/run_all_tests.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-14T21:02:53,645 copying build/lib/flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_problem_sizes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/python/cutlass/conv2d 2026-04-14T21:02:53,648 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/examples 2026-04-14T21:02:53,650 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/examples/CuTeDSL 2026-04-14T21:02:53,651 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/conftest.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL 2026-04-14T21:02:53,654 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-14T21:02:53,655 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/conftest.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-14T21:02:53,657 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_rmsnorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-14T21:02:53,659 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_gemm_persistent_prefetch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-14T21:02:53,661 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_blockscaled_gemm_persistent_prefetch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-14T21:02:53,663 copying build/lib/flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_tutorial_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a 2026-04-14T21:02:53,666 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/unit 2026-04-14T21:02:53,667 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/unit/gemm 2026-04-14T21:02:53,669 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/test/unit/gemm/device 2026-04-14T21:02:53,670 copying build/lib/flashinfer/data/cutlass/test/unit/gemm/device/simt_sm50.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/test/unit/gemm/device 2026-04-14T21:02:53,673 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python 2026-04-14T21:02:53,674 copying build/lib/flashinfer/data/cutlass/python/setup_pycute.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python 2026-04-14T21:02:53,677 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-14T21:02:53,679 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-14T21:02:53,680 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-14T21:02:53,682 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/evt_ops.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-14T21:02:53,684 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/epilogue 2026-04-14T21:02:53,687 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-14T21:02:53,688 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-14T21:02:53,691 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/profiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-14T21:02:53,693 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/check.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-14T21:02:53,695 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/lazy_import.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-14T21:02:53,697 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/utils/datatypes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/utils 2026-04-14T21:02:53,700 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-14T21:02:53,702 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:53,704 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-14T21:02:53,705 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-14T21:02:53,707 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/device.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils 2026-04-14T21:02:53,709 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:53,711 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/memory_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:53,713 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:53,716 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/conv2d_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:53,718 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:53,721 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:53,723 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/compiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:53,726 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/gemm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:53,730 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-14T21:02:53,731 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-14T21:02:53,734 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-14T21:02:53,735 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-14T21:02:53,737 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-14T21:02:53,739 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-14T21:02:53,741 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/emitter_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-14T21:02:53,743 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-14T21:02:53,745 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-14T21:02:53,747 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-14T21:02:53,750 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend 2026-04-14T21:02:53,752 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:53,754 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_shape_type_propagation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:53,756 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_fix_element_d.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:53,757 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:53,759 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/util.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:53,761 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_argument_type.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:53,763 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/smem_size_calculator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:53,767 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_layout_elimination.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:53,769 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_get_impl.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:53,772 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/graph_drawer.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:53,774 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_dag_2_tree.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:53,776 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:53,778 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_no_op_elimination.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:53,780 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_preprocess_red.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes 2026-04-14T21:02:53,783 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-14T21:02:53,784 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-14T21:02:53,786 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/load_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-14T21:02:53,789 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/compute_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-14T21:02:53,791 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/store_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-14T21:02:53,793 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/node.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-14T21:02:53,796 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/dag_ir.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-14T21:02:53,798 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_algorithm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-14T21:02:53,801 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_nodes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-14T21:02:53,803 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir 2026-04-14T21:02:53,806 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt 2026-04-14T21:02:53,808 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-14T21:02:53,810 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-14T21:02:53,812 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/frontend_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-14T21:02:53,814 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/python_ast.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend 2026-04-14T21:02:53,817 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/frontend.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:53,819 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/c_types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:53,821 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/type_hint.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:53,823 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/reduction_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:53,826 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/backend/arguments.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/backend 2026-04-14T21:02:53,828 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/swizzle.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-14T21:02:53,830 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/shape.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-14T21:02:53,833 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/library_defaults.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen 2026-04-14T21:02:53,836 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-14T21:02:53,837 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-14T21:02:53,839 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-14T21:02:53,842 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm_grouped.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-14T21:02:53,844 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/op.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-14T21:02:53,847 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/op/conv.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/op 2026-04-14T21:02:53,850 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-14T21:02:53,851 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-14T21:02:53,853 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit/pytorch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-14T21:02:53,856 copying build/lib/flashinfer/data/cutlass/python/cutlass_cppgen/emit/common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_cppgen/emit 2026-04-14T21:02:53,859 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/pycute 2026-04-14T21:02:53,861 copying build/lib/flashinfer/data/cutlass/python/pycute/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-14T21:02:53,863 copying build/lib/flashinfer/data/cutlass/python/pycute/swizzle.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-14T21:02:53,865 copying build/lib/flashinfer/data/cutlass/python/pycute/layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-14T21:02:53,868 copying build/lib/flashinfer/data/cutlass/python/pycute/typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-14T21:02:53,870 copying build/lib/flashinfer/data/cutlass/python/pycute/int_tuple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/pycute 2026-04-14T21:02:53,873 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:53,874 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/generator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:53,888 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/rank_k_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:53,891 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/trmm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:53,893 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/emit_kernel_listing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:53,896 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:53,898 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/conv3x_emitter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:53,901 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/conv2d_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:53,903 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/manifest.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:53,906 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm100_shapes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:53,908 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:53,911 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm90_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:53,914 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/heuristics.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:53,917 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm100_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:53,919 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/gemm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:53,923 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/sm90_shapes.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:53,925 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/heuristics_provider.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:53,927 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/symm_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:53,930 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/rank_2k_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:53,932 copying build/lib/flashinfer/data/cutlass/python/cutlass_library/conv3d_operation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/cutlass_library 2026-04-14T21:02:53,934 copying build/lib/flashinfer/data/cutlass/python/setup_library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python 2026-04-14T21:02:53,937 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL 2026-04-14T21:02:53,939 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-14T21:02:53,940 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:53,941 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hopper_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:53,944 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:53,946 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/print_latex.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:53,949 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_tile_scheduler_helper.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:53,951 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/distributed.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:53,953 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensormap_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:53,955 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_persistent_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:53,958 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:53,960 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blockscaled_layout.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:53,962 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tmem_allocator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:53,965 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensor_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:53,967 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hardware_info.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:53,969 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-14T21:02:53,970 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm/sm100.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-14T21:02:53,973 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm 2026-04-14T21:02:53,975 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/dynamic_persistent_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:53,977 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blackwell_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:53,980 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/static_persistent_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:53,983 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/mixed_input_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:53,986 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/smem_allocator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils 2026-04-14T21:02:53,988 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-14T21:02:53,991 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:53,993 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-14T21:02:53,994 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-14T21:02:53,996 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/tree_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-14T21:02:53,998 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/timer.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-14T21:02:54,000 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/numpy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-14T21:02:54,002 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/logger.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-14T21:02:54,004 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/stacktrace.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils 2026-04-14T21:02:54,006 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:54,009 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-14T21:02:54,009 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/cuda.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-14T21:02:54,012 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-14T21:02:54,014 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/tensor_descriptor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-14T21:02:54,016 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/device_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-14T21:02:54,019 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/dlpack_types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-14T21:02:54,021 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/stream_adapter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-14T21:02:54,023 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/jit_arg_adapters.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime 2026-04-14T21:02:54,026 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-14T21:02:54,027 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/call_provider.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-14T21:02:54,030 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/mlir_builder.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-14T21:02:54,032 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-14T21:02:54,034 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/spec.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-14T21:02:54,036 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/tvm_ffi_builder.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder 2026-04-14T21:02:54,040 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/env_manager.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:54,042 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/arch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:54,045 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/dsl.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:54,048 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/compiler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:54,050 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:54,053 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-14T21:02:54,054 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-14T21:02:54,056 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/external_binary_module.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-14T21:02:54,059 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/export.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-14T21:02:54,061 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/c_header_generator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export 2026-04-14T21:02:54,064 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:54,067 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:54,070 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/cache_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:54,073 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-14T21:02:54,074 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-14T21:02:54,076 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/op.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-14T21:02:54,078 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/lru_cache_ir.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-14T21:02:54,080 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/gpu.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-14T21:02:54,082 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/arith.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers 2026-04-14T21:02:54,084 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/version_info.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:54,086 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/jit_executor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:54,089 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_preprocessor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl 2026-04-14T21:02:54,093 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:54,094 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/runtime.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:54,097 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/ffi.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:54,100 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-14T21:02:54,101 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-14T21:02:54,103 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-14T21:02:54,105 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/memory.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-14T21:02:54,107 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/algorithm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-14T21:02:54,110 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-14T21:02:54,112 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/math.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-14T21:02:54,114 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental 2026-04-14T21:02:54,117 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-14T21:02:54,118 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/tmem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-14T21:02:54,120 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-14T21:02:54,122 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/numeric_conversion.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-14T21:02:54,125 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/smem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-14T21:02:54,127 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/nvvm_wrappers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-14T21:02:54,131 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/mbar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-14T21:02:54,133 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/elect.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-14T21:02:54,135 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/clc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch 2026-04-14T21:02:54,137 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:54,140 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/testing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:54,143 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/_tvm_ffi_args_spec_converter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:54,146 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-14T21:02:54,147 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-14T21:02:54,148 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-14T21:02:54,151 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-14T21:02:54,152 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp 2026-04-14T21:02:54,154 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-14T21:02:54,156 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-14T21:02:54,159 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/common.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu 2026-04-14T21:02:54,161 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-14T21:02:54,162 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-14T21:02:54,165 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-14T21:02:54,167 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-14T21:02:54,169 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05 2026-04-14T21:02:54,173 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-14T21:02:54,174 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-14T21:02:54,177 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-14T21:02:54,179 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync 2026-04-14T21:02:54,182 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-14T21:02:54,183 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-14T21:02:54,185 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-14T21:02:54,187 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/mma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup 2026-04-14T21:02:54,190 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-14T21:02:54,191 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-14T21:02:54,193 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/load.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-14T21:02:54,195 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/export.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-14T21:02:54,197 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/aot_config.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-14T21:02:54,199 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/c_header_generator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export 2026-04-14T21:02:54,202 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/tuple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:54,205 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/algorithm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:54,208 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/typing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:54,210 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:54,216 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:54,219 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/atom.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:54,222 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/math.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute 2026-04-14T21:02:54,225 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-14T21:02:54,226 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/ffi.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-14T21:02:54,229 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-14T21:02:54,231 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/testing.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-14T21:02:54,234 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/primitive.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-14T21:02:54,236 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/compile.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-14T21:02:54,238 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/types.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax 2026-04-14T21:02:54,241 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/torch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-14T21:02:54,244 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-14T21:02:54,245 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm100.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-14T21:02:54,248 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm90.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-14T21:02:54,251 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-14T21:02:54,253 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline 2026-04-14T21:02:54,257 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-14T21:02:54,258 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-14T21:02:54,262 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/tvm_ffi_provider.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-14T21:02:54,265 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-14T21:02:54,267 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass_ast_decorators.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-14T21:02:54,270 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_stream_adapter.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-14T21:02:54,272 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_jit_executor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl 2026-04-14T21:02:54,274 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/cutlass/impl_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL/cutlass 2026-04-14T21:02:54,276 copying build/lib/flashinfer/data/cutlass/python/CuTeDSL/prep_editable_install.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/CuTeDSL 2026-04-14T21:02:54,279 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/docs_src 2026-04-14T21:02:54,280 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/python/docs_src/source 2026-04-14T21:02:54,282 copying build/lib/flashinfer/data/cutlass/python/docs_src/source/conf.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python/docs_src/source 2026-04-14T21:02:54,284 copying build/lib/flashinfer/data/cutlass/python/setup_cutlass.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/python 2026-04-14T21:02:54,287 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools 2026-04-14T21:02:54,288 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util 2026-04-14T21:02:54,290 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include 2026-04-14T21:02:54,292 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass 2026-04-14T21:02:54,294 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,295 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_uncompress.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,298 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/exceptions.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,300 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,303 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_padding.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,306 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference 2026-04-14T21:02:54,307 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:54,309 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:54,311 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:54,314 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:54,317 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:54,319 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:54,322 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:54,325 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/rank_2k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:54,327 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:54,330 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_compare.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:54,333 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2026-04-14T21:02:54,334 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread 2026-04-14T21:02:54,337 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-14T21:02:54,338 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-14T21:02:54,340 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-14T21:02:54,343 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel 2026-04-14T21:02:54,345 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:54,347 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gett.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device 2026-04-14T21:02:54,351 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,352 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,355 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,357 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,360 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,363 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,365 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,368 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_foreach.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,370 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,373 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,375 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,377 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,379 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_copy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,382 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,384 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,386 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,389 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,391 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,394 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,396 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/conv.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,399 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/error_metrics.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,401 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_norm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,403 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,405 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gett.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,408 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host 2026-04-14T21:02:54,411 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-14T21:02:54,412 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/linear_to_coordinate.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-14T21:02:54,414 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/inner_product.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail 2026-04-14T21:02:54,417 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/print_error.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,419 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/packed_stride.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,422 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/tensor_view_io.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,424 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_memory.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,426 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/cublas_wrappers.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,429 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/command_line.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,431 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_dump.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,434 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/gett_commandline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,436 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,439 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/distribution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,441 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/debug.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,444 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_to_nchw.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,446 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_pooling.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,449 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/helper_cuda.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,451 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/index_sequence.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,452 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,454 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/type_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,457 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_layernorm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,459 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/host_reorder.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,462 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_rmsnorm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,464 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nchw_to_nhwc.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,467 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/GPU_Clock.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,468 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/mixed_dtype_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,471 copying build/lib/flashinfer/data/cutlass/tools/util/include/cutlass/util/device_groupnorm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/include/cutlass/util 2026-04-14T21:02:54,474 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/tools/util/scripts 2026-04-14T21:02:54,475 copying build/lib/flashinfer/data/cutlass/tools/util/scripts/split_test_cmake.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/tools/util/scripts 2026-04-14T21:02:54,479 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples 2026-04-14T21:02:54,480 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen 2026-04-14T21:02:54,482 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:54,484 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_sample.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:54,487 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_device.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:54,489 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_verify.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:54,491 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_ir.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:54,494 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_kernel.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:54,496 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/replace_fix_impl_header.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:54,498 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_cmake.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:54,501 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_all_code.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:54,503 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_threadblock.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:54,506 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_turing_and_volta.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:54,508 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_customized_epilogue.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:54,511 copying build/lib/flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/helper.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen 2026-04-14T21:02:54,513 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-14T21:02:54,514 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/conv2d.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-14T21:02:54,517 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-14T21:02:54,519 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/gemm_grouped.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py 2026-04-14T21:02:54,522 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-14T21:02:54,523 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable/conv2d.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-14T21:02:54,526 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-14T21:02:54,528 copying build/lib/flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm_grouped.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/40_cutlass_py/customizable 2026-04-14T21:02:54,532 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python 2026-04-14T21:02:54,533 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL 2026-04-14T21:02:54,535 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental 2026-04-14T21:02:54,537 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere 2026-04-14T21:02:54,538 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere/memcpy_simt_universal_copy.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere 2026-04-14T21:02:54,542 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-14T21:02:54,543 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_cute_pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-14T21:02:54,546 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_2sm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-14T21:02:54,549 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_block_scaled_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-14T21:02:54,552 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_ptr_array.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-14T21:02:54,555 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell 2026-04-14T21:02:54,560 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:54,561 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/flash_attention_v2.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:54,564 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_from_jit.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:54,567 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/tensorop_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:54,570 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/cooperative_launch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:54,572 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:54,575 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add_autotune.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:54,578 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/sgemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:54,581 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/hstu_attention.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:54,583 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_apply.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:54,586 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/dynamic_smem_size.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:54,588 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_bypass_dlpack.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:54,590 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/smem_allocator.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:54,593 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/inline_ptx.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/ampere 2026-04-14T21:02:54,596 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-14T21:02:54,597 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-14T21:02:54,598 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils/fmha_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-14T21:02:54,601 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils/sparse_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-14T21:02:54,604 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/utils/test_sparse_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/utils 2026-04-14T21:02:54,607 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce 2026-04-14T21:02:54,608 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce 2026-04-14T21:02:54,612 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-14T21:02:54,613 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/print_latex.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-14T21:02:54,616 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-14T21:02:54,617 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_torch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-14T21:02:54,619 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/error_reporting.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-14T21:02:54,621 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/compile_with_fake_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-14T21:02:54,623 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/ampere_gemm_with_fake_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-14T21:02:54,625 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_jax.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-14T21:02:54,627 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_export.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-14T21:02:54,629 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_torch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-14T21:02:54,631 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_jax.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi 2026-04-14T21:02:54,633 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/torch_fake_tensor.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute 2026-04-14T21:02:54,635 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2026-04-14T21:02:54,636 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi/jit_argument.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi 2026-04-14T21:02:54,640 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-14T21:02:54,641 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export/load_in_python.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-14T21:02:54,643 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export/export_to_c.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export 2026-04-14T21:02:54,645 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-14T21:02:54,646 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/cta_norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-14T21:02:54,649 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/fmha.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-14T21:02:54,653 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-14T21:02:54,656 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/hopper 2026-04-14T21:02:54,660 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-14T21:02:54,662 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_simple.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-14T21:02:54,664 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_reduce_scatter_blackwell.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-14T21:02:54,669 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_all_reduce_blackwell.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-14T21:02:54,673 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_tma.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-14T21:02:54,676 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_one_shot_lamport.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-14T21:02:54,679 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_two_shot_multimem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-14T21:02:54,681 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_all_gather_gemm_blackwell.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/distributed 2026-04-14T21:02:54,685 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-14T21:02:54,686 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_basic.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-14T21:02:54,689 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax/elementwise_apply_example.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-14T21:02:54,691 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_export.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-14T21:02:54,693 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_sharding.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/jax 2026-04-14T21:02:54,697 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:54,698 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha_bwd.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:54,702 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_blockscaled_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:54,707 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-14T21:02:54,709 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_host_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-14T21:02:54,711 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm_acc_scale.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-14T21:02:54,715 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-14T21:02:54,719 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm 2026-04-14T21:02:54,724 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-14T21:02:54,726 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/activation_custom_epilogue_dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-14T21:02:54,728 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_efc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-14T21:02:54,732 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/custom_epilogue_dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-14T21:02:54,735 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/synthetic_custom_epilogue_dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-14T21:02:54,738 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_dense_gemm_efc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue 2026-04-14T21:02:54,741 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_software_pipeline.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:54,745 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:54,751 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-14T21:02:54,752 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_0.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-14T21:02:54,754 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_1.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-14T21:02:54,758 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_1.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-14T21:02:54,761 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_0.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-14T21:02:54,764 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm 2026-04-14T21:02:54,766 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:54,774 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_alpha_beta_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:54,778 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_amax.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:54,782 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:54,787 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:54,792 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-14T21:02:54,793 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/masked_grouped_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-14T21:02:54,797 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/contiguous_grouped_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-14T21:02:54,802 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/blockwise_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm 2026-04-14T21:02:54,807 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-14T21:02:54,808 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp16.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-14T21:02:54,814 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp8.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-14T21:02:54,820 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla 2026-04-14T21:02:54,823 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_prefetch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:54,828 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_prefetch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:54,832 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/reduce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:54,835 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/rmsnorm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:54,838 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/sm103_dense_blockscaled_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:54,842 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/programmatic_dependent_launch.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:54,845 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:54,849 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-14T21:02:54,850 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-14T21:02:54,855 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-14T21:02:54,857 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_reference.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd 2026-04-14T21:02:54,860 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-14T21:02:54,861 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/prefill_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-14T21:02:54,864 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d512.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-14T21:02:54,868 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_decode.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-14T21:02:54,872 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d256.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha 2026-04-14T21:02:54,875 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_dynamic.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:54,878 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell 2026-04-14T21:02:54,883 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-14T21:02:54,884 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-14T21:02:54,886 copying build/lib/flashinfer/data/cutlass/examples/python/CuTeDSL/helpers/fmha_helpers.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/python/CuTeDSL/helpers 2026-04-14T21:02:54,890 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-14T21:02:54,891 copying build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention/fmha_backward_test.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-14T21:02:54,893 copying build/lib/flashinfer/data/cutlass/examples/41_fused_multi_head_attention/piped_subprocess.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/examples/41_fused_multi_head_attention 2026-04-14T21:02:54,896 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include 2026-04-14T21:02:54,898 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute 2026-04-14T21:02:54,899 copying build/lib/flashinfer/data/cutlass/include/cute/swizzle.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-14T21:02:54,903 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:54,904 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_desc.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:54,906 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm70.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:54,909 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:54,951 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:54,953 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm120_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:54,957 copying build/lib/flashinfer/data/cutlass/include/cute/arch/config.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:54,959 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm90_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:54,962 copying build/lib/flashinfer/data/cutlass/include/cute/arch/cluster_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:54,964 copying build/lib/flashinfer/data/cutlass/include/cute/arch/tmem_allocator_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:54,966 copying build/lib/flashinfer/data/cutlass/include/cute/arch/util.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:54,968 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:54,976 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:54,980 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:54,982 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:55,001 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm61.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:55,003 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:55,005 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm100_umma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:55,009 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm100_desc.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:55,012 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm50.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:55,014 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm100_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:55,017 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm120.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:55,022 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:55,024 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:55,026 copying build/lib/flashinfer/data/cutlass/include/cute/arch/cluster_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:55,029 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm90_desc.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:55,031 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:55,038 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:55,094 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:55,096 copying build/lib/flashinfer/data/cutlass/include/cute/arch/copy_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:55,099 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm89.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:55,101 copying build/lib/flashinfer/data/cutlass/include/cute/arch/simd_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:55,103 copying build/lib/flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/arch 2026-04-14T21:02:55,122 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-14T21:02:55,125 copying build/lib/flashinfer/data/cutlass/include/cute/config.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-14T21:02:55,127 copying build/lib/flashinfer/data/cutlass/include/cute/stride.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-14T21:02:55,130 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_base.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-14T21:02:55,133 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/numeric 2026-04-14T21:02:55,135 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/integer_sequence.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-14T21:02:55,138 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/integral_ratio.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-14T21:02:55,140 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/numeric_types.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-14T21:02:55,143 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/arithmetic_tuple.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-14T21:02:55,146 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/integral_constant.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-14T21:02:55,149 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/complex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-14T21:02:55,151 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/math.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-14T21:02:55,154 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/real.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-14T21:02:55,157 copying build/lib/flashinfer/data/cutlass/include/cute/numeric/int.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/numeric 2026-04-14T21:02:55,160 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/container 2026-04-14T21:02:55,161 copying build/lib/flashinfer/data/cutlass/include/cute/container/array_aligned.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-14T21:02:55,164 copying build/lib/flashinfer/data/cutlass/include/cute/container/array_subbyte.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-14T21:02:55,167 copying build/lib/flashinfer/data/cutlass/include/cute/container/cuda_types.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-14T21:02:55,170 copying build/lib/flashinfer/data/cutlass/include/cute/container/type_list.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-14T21:02:55,172 copying build/lib/flashinfer/data/cutlass/include/cute/container/bit_field.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-14T21:02:55,175 copying build/lib/flashinfer/data/cutlass/include/cute/container/alignment.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-14T21:02:55,177 copying build/lib/flashinfer/data/cutlass/include/cute/container/tuple.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-14T21:02:55,180 copying build/lib/flashinfer/data/cutlass/include/cute/container/array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/container 2026-04-14T21:02:55,183 copying build/lib/flashinfer/data/cutlass/include/cute/tensor_zip.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-14T21:02:55,186 copying build/lib/flashinfer/data/cutlass/include/cute/swizzle_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-14T21:02:55,189 copying build/lib/flashinfer/data/cutlass/include/cute/layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-14T21:02:55,194 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:55,195 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/copy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:55,199 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/prefetch.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:55,201 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/axpby.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:55,204 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/cooperative_copy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:55,207 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/gemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:55,210 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/tensor_reduce.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:55,212 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/tuple_algorithms.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:55,216 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/clear.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:55,218 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/tensor_algorithms.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:55,221 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/prefer.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:55,223 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/fill.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:55,226 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/cooperative_gemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:55,229 copying build/lib/flashinfer/data/cutlass/include/cute/algorithm/functional.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/algorithm 2026-04-14T21:02:55,231 copying build/lib/flashinfer/data/cutlass/include/cute/underscore.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-14T21:02:55,234 copying build/lib/flashinfer/data/cutlass/include/cute/int_tuple.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-14T21:02:55,238 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,240 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,257 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_atom.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,260 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,268 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm70.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,270 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120_sparse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,272 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm61.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,274 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,277 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,288 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,290 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm89.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,292 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,297 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,299 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_im2col.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,302 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,304 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm50.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,306 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm80.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,309 copying build/lib/flashinfer/data/cutlass/include/cute/atom/partitioner.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,311 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,318 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,321 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_atom.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,324 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,326 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_im2col.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,329 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,331 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm75.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,334 copying build/lib/flashinfer/data/cutlass/include/cute/atom/mma_traits_sm100.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,339 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,342 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma_swizzle.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,344 copying build/lib/flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/atom 2026-04-14T21:02:55,348 copying build/lib/flashinfer/data/cutlass/include/cute/layout_composed.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-14T21:02:55,351 copying build/lib/flashinfer/data/cutlass/include/cute/tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-14T21:02:55,353 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_flagged.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-14T21:02:55,355 copying build/lib/flashinfer/data/cutlass/include/cute/pointer_swizzle.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-14T21:02:55,357 copying build/lib/flashinfer/data/cutlass/include/cute/tensor_impl.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-14T21:02:55,361 copying build/lib/flashinfer/data/cutlass/include/cute/pointer.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute 2026-04-14T21:02:55,364 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cute/util 2026-04-14T21:02:55,365 copying build/lib/flashinfer/data/cutlass/include/cute/util/type_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-14T21:02:55,368 copying build/lib/flashinfer/data/cutlass/include/cute/util/print_latex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-14T21:02:55,371 copying build/lib/flashinfer/data/cutlass/include/cute/util/print.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-14T21:02:55,374 copying build/lib/flashinfer/data/cutlass/include/cute/util/print_tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-14T21:02:55,377 copying build/lib/flashinfer/data/cutlass/include/cute/util/print_svg.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-14T21:02:55,379 copying build/lib/flashinfer/data/cutlass/include/cute/util/debug.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cute/util 2026-04-14T21:02:55,384 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:55,386 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/platform 2026-04-14T21:02:55,388 copying build/lib/flashinfer/data/cutlass/include/cutlass/platform/platform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/platform 2026-04-14T21:02:55,391 copying build/lib/flashinfer/data/cutlass/include/cutlass/uint128.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:55,394 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_ref_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:55,396 copying build/lib/flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:55,399 copying build/lib/flashinfer/data/cutlass/include/cutlass/array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:55,404 copying build/lib/flashinfer/data/cutlass/include/cutlass/cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:55,407 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue 2026-04-14T21:02:55,410 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,411 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,414 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor_callbacks.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,417 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/direct_store_epilogue_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,420 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,423 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,425 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,429 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,431 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_workspace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,434 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,437 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,440 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,444 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_direct_store.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,446 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine_layout_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,449 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_depthwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,452 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_scaling_factor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,455 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_streamk_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,458 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,460 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_iterator_parameter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,463 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,466 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,470 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,473 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op_blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,477 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_direct_conv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,480 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/interleaved_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,484 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,487 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,491 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_direct_store.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,495 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,499 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,503 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,507 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,510 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_mixed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,514 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_smem_accumulator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,518 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-14T21:02:55,519 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_2x.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-14T21:02:55,523 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_load.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-14T21:02:55,526 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitors.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-14T21:02:55,529 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_store.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-14T21:02:55,532 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_compute.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion 2026-04-14T21:02:55,535 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,538 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,542 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,545 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_tile_thread_map.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,548 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op_blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,552 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_visitor_with_softmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,555 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_strided_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,559 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,562 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,566 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_gemm_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,568 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,572 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_conv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,575 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_predicates.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,578 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,581 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,584 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/threadblock 2026-04-14T21:02:55,588 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:55,589 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:55,593 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op_mixed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:55,597 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:55,599 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:55,602 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/wmma_tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:55,605 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:55,608 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:55,611 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_gaussian_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:55,614 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:55,617 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:55,621 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/simt_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:55,623 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:55,626 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:55,629 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_volta_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:55,632 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/warp/volta_tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/warp 2026-04-14T21:02:55,636 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,637 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_drelu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,641 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_silu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,643 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/activation.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,646 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,649 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,653 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,655 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_dgelu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,658 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_leaky_relu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,661 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu0.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,664 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_tensor_broadcast.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,667 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,670 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_sigmoid.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,672 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic_with_scaling.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,674 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,677 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/scale_type.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,679 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_with_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,681 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,684 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,686 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_residual_block.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,689 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_gelu.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,691 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/reduction_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,693 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_elementwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,695 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_hardswish.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,697 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_clamp.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,700 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/thread/conversion_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/thread 2026-04-14T21:02:55,703 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:55,704 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:55,707 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:55,710 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:55,713 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:55,717 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:55,720 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_epilogue.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:55,722 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_planar_complex_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:55,726 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized_array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:55,729 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_builder.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:55,732 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:55,736 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized_bias_elementwise.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:55,740 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-14T21:02:55,741 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-14T21:02:55,744 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-14T21:02:55,746 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-14T21:02:55,750 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm103_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-14T21:02:55,753 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm100_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-14T21:02:55,760 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders 2026-04-14T21:02:55,763 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_nosmem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:55,767 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_nosmem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:55,772 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/epilogue_tensor_broadcast.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:55,775 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_nosmem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:55,779 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue_array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:55,782 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:55,786 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/collective 2026-04-14T21:02:55,791 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:55,792 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_compute_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:55,796 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_callbacks_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:55,802 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/callbacks.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:55,805 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:55,809 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_load_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:55,813 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/operations.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:55,816 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_topk_softmax.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:55,820 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_callbacks_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:55,824 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_store_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:55,828 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_store_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:55,832 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_callbacks_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:55,837 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_compute_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:55,840 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_visitor_store_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue/fusion 2026-04-14T21:02:55,844 copying build/lib/flashinfer/data/cutlass/include/cutlass/epilogue/dispatch_policy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/epilogue 2026-04-14T21:02:55,848 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental 2026-04-14T21:02:55,850 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed 2026-04-14T21:02:55,852 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-14T21:02:55,854 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_base_schedule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-14T21:02:55,857 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_1d_schedules.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules 2026-04-14T21:02:55,861 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-14T21:02:55,863 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-14T21:02:55,866 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/full_barrier.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-14T21:02:55,868 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/dist_gemm_universal_wrapper.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/device 2026-04-14T21:02:55,872 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-14T21:02:55,873 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-14T21:02:55,876 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/full_barrier.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-14T21:02:55,878 copying build/lib/flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/dist_gemm_kernel_wrapper.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel 2026-04-14T21:02:55,881 copying build/lib/flashinfer/data/cutlass/include/cutlass/uint256.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:55,885 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,887 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/cache_operation.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,889 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,893 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm50.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,896 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm89.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,898 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,901 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,904 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,907 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/config.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,910 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/arch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,912 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,915 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,918 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/memory.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,921 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/memory_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,924 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/grid_dependency_control.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,927 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/simd_sm61.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,930 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/wmma_sm72.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,932 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/memory_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,935 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/synclog.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,938 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,941 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm60.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,943 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/reg_reconfig.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,945 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm89.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,948 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm61.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,950 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/barrier.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,953 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm90.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,955 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/simd_sm60.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,957 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,960 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/mma_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,962 copying build/lib/flashinfer/data/cutlass/include/cutlass/arch/simd.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/arch 2026-04-14T21:02:55,965 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_view_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:55,967 copying build/lib/flashinfer/data/cutlass/include/cutlass/numeric_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:55,969 copying build/lib/flashinfer/data/cutlass/include/cutlass/blas3_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:55,972 copying build/lib/flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:55,974 copying build/lib/flashinfer/data/cutlass/include/cutlass/subbyte_reference.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:55,977 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_ref.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:55,979 copying build/lib/flashinfer/data/cutlass/include/cutlass/floating_point_nvrtc.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:55,981 copying build/lib/flashinfer/data/cutlass/include/cutlass/real.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:55,983 copying build/lib/flashinfer/data/cutlass/include/cutlass/numeric_conversion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:55,990 copying build/lib/flashinfer/data/cutlass/include/cutlass/semaphore.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:55,992 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:55,994 copying build/lib/flashinfer/data/cutlass/include/cutlass/float_subbyte.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:55,997 copying build/lib/flashinfer/data/cutlass/include/cutlass/aligned_buffer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:55,999 copying build/lib/flashinfer/data/cutlass/include/cutlass/block_striped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:56,002 copying build/lib/flashinfer/data/cutlass/include/cutlass/constants.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:56,005 copying build/lib/flashinfer/data/cutlass/include/cutlass/functional.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:56,008 copying build/lib/flashinfer/data/cutlass/include/cutlass/matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:56,015 copying build/lib/flashinfer/data/cutlass/include/cutlass/coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:56,017 copying build/lib/flashinfer/data/cutlass/include/cutlass/exmy_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:56,021 copying build/lib/flashinfer/data/cutlass/include/cutlass/cluster_launch.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:56,023 copying build/lib/flashinfer/data/cutlass/include/cutlass/integer_subbyte.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:56,026 copying build/lib/flashinfer/data/cutlass/include/cutlass/version.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:56,029 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/layout 2026-04-14T21:02:56,030 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/vector.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-14T21:02:56,032 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-14T21:02:56,035 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-14T21:02:56,038 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-14T21:02:56,041 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-14T21:02:56,044 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/tensor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-14T21:02:56,047 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-14T21:02:56,049 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/permute.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-14T21:02:56,052 copying build/lib/flashinfer/data/cutlass/include/cutlass/layout/layout.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/layout 2026-04-14T21:02:56,054 copying build/lib/flashinfer/data/cutlass/include/cutlass/device_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:56,057 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm 2026-04-14T21:02:56,058 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2026-04-14T21:02:56,061 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,062 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_with_reduction_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,065 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,068 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,071 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,074 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,077 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_gemv_core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,079 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sparse_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,083 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,085 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_access_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,088 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_layernorm_mainloop_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,091 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,094 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,097 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_trmm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,100 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,103 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,105 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_singlestage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,108 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_softmax_mainloop_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,110 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,113 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,116 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_multistage_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,119 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,121 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,124 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,127 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,131 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm75.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,134 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,137 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_ell_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,140 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,143 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,145 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,148 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/index_remat.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,150 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,153 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_sparse_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,155 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,157 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,160 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,163 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,165 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,168 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,170 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_blas3_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,173 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,176 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_softmax_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,178 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,181 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/threadblock 2026-04-14T21:02:56,184 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,185 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,188 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/scale_bias_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,191 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,194 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,196 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fragment_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,199 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,202 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,204 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,206 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,212 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,215 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_with_reduction_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,217 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,220 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,223 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,225 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_wmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,228 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,231 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_wmma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,233 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op_tile_iterator_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,236 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sparse.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,238 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,243 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_with_reduction_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,245 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/layernorm_scale_bias_transform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,247 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_mixed_input_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,249 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_policy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,251 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_sparse_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,254 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fast_f32.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,256 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,258 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,260 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/tile_iterator_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,263 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,266 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_sparse_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,268 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,272 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_fast_f32.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,275 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/softmax_scale_bias_transform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,277 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_tile_iterator_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/warp 2026-04-14T21:02:56,282 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,283 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_with_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,286 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,288 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,291 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,294 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/rank_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,296 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemv_blockscaled.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,298 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/symm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,301 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,304 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/base_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,306 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,308 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,311 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,313 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,315 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_batched.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,318 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,320 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_splitk_parallel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,323 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_adapter.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,326 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,328 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/ell_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,338 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,340 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,363 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,365 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,368 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,371 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,373 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,376 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,380 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/default_gemm_configuration.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,384 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_streamk_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,387 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/device 2026-04-14T21:02:56,390 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/group_array_problem_shape.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2026-04-14T21:02:56,394 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-14T21:02:56,396 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm50.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-14T21:02:56,399 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-14T21:02:56,402 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm60.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-14T21:02:56,406 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm61.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/thread 2026-04-14T21:02:56,411 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,412 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_array_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,417 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,422 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,426 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_rs_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,430 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_sparse_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,435 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,439 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,444 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,449 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma_decl.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,452 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,456 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,461 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,465 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,469 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,474 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,478 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,481 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_planar_complex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,484 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,488 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,490 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm70_mma_twostage.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,493 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/fp8_accumulation.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,496 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,499 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_rcggemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,503 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_sparse_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,507 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,510 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,514 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_array_tma_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,517 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,521 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_array_multistage.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,524 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,527 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_tf32.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,531 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,535 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,538 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,542 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,544 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_interleaved_complex_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,547 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockwise_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,550 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_config.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,553 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,556 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_sparse_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,558 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_sparse_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,561 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,564 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,567 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_interleaved_complex_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,570 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockwise_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,572 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_input_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,575 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_gmma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,578 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,581 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,585 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,588 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,591 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_gmma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,595 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,597 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_sparse_mma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,600 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_simt_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,603 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_tma_cpasync_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,606 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_sparse_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,609 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_sparse_config.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,612 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_cpasync_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,615 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_planar_complex_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,617 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_mixed_tma_cpasync_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,620 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm103_blockscaled_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,623 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,626 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_pipeline_carveout.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective/builders 2026-04-14T21:02:56,629 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,632 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,635 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_multistage.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,638 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_array_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,642 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_mixed_tma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,645 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_sparse_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,649 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,653 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,657 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_planar_complex.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,660 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_blockwise_scaling.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,664 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,668 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_tf32.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,671 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,674 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_emulated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,678 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,681 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,684 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_sparse_mma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,688 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder_decl.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,691 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized_rcggemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/collective 2026-04-14T21:02:56,695 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/dispatch_policy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2026-04-14T21:02:56,699 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/gemm_enumerated_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm 2026-04-14T21:02:56,703 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,705 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,708 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_input_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,712 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,715 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_softmax_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,717 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,720 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/static_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,722 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,725 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,728 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_group.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,730 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,733 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,735 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,737 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,739 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,742 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,744 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,746 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_per_group_scale.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,749 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,751 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_streamk_with_fused_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,754 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_cooperative.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,757 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,759 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,761 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_blockscaled.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,764 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,767 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,769 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,772 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_splitk_parallel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,775 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,778 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,781 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,783 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_streamk_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,786 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,788 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,791 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,794 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_cooperative.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,797 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,799 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,802 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,804 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_mma_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,807 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,810 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_transpose_operands.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,812 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,814 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,817 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_per_group_scale.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,820 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_transpose_operands.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,822 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,825 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,827 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,830 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_decl.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,831 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_cooperative.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,835 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,838 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,840 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,843 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,845 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,848 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,851 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_static_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,853 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,856 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_universal_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,858 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,860 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_group.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,862 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_batched.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,865 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm120_gemm_tma_warpspecialized_cooperative_asymmetric_dma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,868 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_batched_strided.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,870 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_planar_complex_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,872 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,875 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,877 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_splitk_parallel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,879 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,882 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_layernorm_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,884 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,887 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_stream_k.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,890 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_sparse_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,893 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mixed_input_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,896 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,898 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_input_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,901 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,904 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_softmax_mainloop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,906 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,909 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_blockwise.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,911 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_ell_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,914 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,916 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,919 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,921 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/symm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,924 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,926 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/ell_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,929 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,931 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,933 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_fused_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,936 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,939 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,941 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm_array.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,943 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_k_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,946 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,949 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_stream_k.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,952 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,955 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_mixed_tma_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,959 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,962 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,965 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,968 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,970 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor_streamk.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,974 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,977 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,979 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,982 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,985 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mma_transform.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,988 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,991 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/grouped_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,994 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/trmm_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,997 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:56,999 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_k_universal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:57,002 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:57,004 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_sparse_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/gemm/kernel 2026-04-14T21:02:57,007 copying build/lib/flashinfer/data/cutlass/include/cutlass/workspace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,011 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform 2026-04-14T21:02:57,013 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,014 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op_sm70.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,018 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear_2dthreadtile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,021 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_2dthreadtile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,024 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,027 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,029 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_2dthreadtile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,032 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_scale_bias_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,035 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,038 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_triangular_matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,042 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,044 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,047 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear_direct_conv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,050 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,053 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,057 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/vector_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,059 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,062 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,065 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,069 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,072 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,075 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,078 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,082 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,085 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,088 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_triangular_matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/threadblock 2026-04-14T21:02:57,091 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/warp 2026-04-14T21:02:57,093 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/warp/vector_fragment_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/warp 2026-04-14T21:02:57,096 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/device 2026-04-14T21:02:57,098 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/device 2026-04-14T21:02:57,101 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-14T21:02:57,102 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread/transpose.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-14T21:02:57,104 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/thread/unary_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/thread 2026-04-14T21:02:57,107 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/collective 2026-04-14T21:02:57,109 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/collective/sm90_wgmma_transpose.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/collective 2026-04-14T21:02:57,112 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/pitch_linear_thread_map.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform 2026-04-14T21:02:57,116 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-14T21:02:57,117 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel/filter_format_transformer.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-14T21:02:57,120 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel/sparse_gemm_compressor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-14T21:02:57,123 copying build/lib/flashinfer/data/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/transform/kernel 2026-04-14T21:02:57,126 copying build/lib/flashinfer/data/cutlass/include/cutlass/wmma_array.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,129 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv 2026-04-14T21:02:57,131 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,133 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,135 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,138 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,140 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,142 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_fprop_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,145 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,148 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,150 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,152 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/threadblock_swizzle.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,154 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,157 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,159 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,162 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_fixed_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,164 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,167 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_few_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,169 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,171 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_direct_conv_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,174 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,176 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,179 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_few_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,181 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_core_with_lane_access_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,184 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,186 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,189 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,191 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,193 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_fixed_channels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,196 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_filter_tile_access_iterator_direct_conv_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,198 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,200 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_fixed_stride_dilation.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,202 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,205 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_direct_conv_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,207 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,209 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,212 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,214 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,217 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_access_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,219 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,222 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,225 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,227 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,229 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,232 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,234 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_optimized.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,237 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,239 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_wgrad_fusion_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,242 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_analytic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/threadblock 2026-04-14T21:02:57,245 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-14T21:02:57,246 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt_tile_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-14T21:02:57,249 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp/scale_bias_relu_transform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-14T21:02:57,251 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/warp 2026-04-14T21:02:57,254 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-14T21:02:57,257 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-14T21:02:57,258 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/direct_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-14T21:02:57,260 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-14T21:02:57,263 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-14T21:02:57,265 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/device/conv_universal_adapter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/device 2026-04-14T21:02:57,268 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/conv2d_problem_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-14T21:02:57,270 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-14T21:02:57,273 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/thread 2026-04-14T21:02:57,274 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/thread/depthwise_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/thread 2026-04-14T21:02:57,277 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-14T21:02:57,278 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/detail.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-14T21:02:57,281 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/sm100_implicit_gemm_umma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-14T21:02:57,284 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/sm90_implicit_gemm_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-14T21:02:57,286 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/collective_builder.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-14T21:02:57,288 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/collective_conv.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective 2026-04-14T21:02:57,291 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-14T21:02:57,292 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_umma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-14T21:02:57,294 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-14T21:02:57,297 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_gmma_builder.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-14T21:02:57,299 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_common.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/collective/builders 2026-04-14T21:02:57,302 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/dispatch_policy.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-14T21:02:57,304 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/conv3d_problem_size.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-14T21:02:57,307 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,308 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,311 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,313 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/direct_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,316 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,319 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_reduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,321 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,324 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/sm100_implicit_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,327 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,329 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,332 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,335 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/conv_universal.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,336 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,339 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,341 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,344 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,346 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,349 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_fused_epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,351 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_strided_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,353 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,356 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/sm90_implicit_gemm_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,358 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_absmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,360 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,362 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,365 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_group_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,368 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_wgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,370 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_with_broadcast.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,373 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_depthwise_fprop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,375 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_dgrad.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,378 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_fusion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv/kernel 2026-04-14T21:02:57,381 copying build/lib/flashinfer/data/cutlass/include/cutlass/conv/convnd_problem_shape.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/conv 2026-04-14T21:02:57,383 copying build/lib/flashinfer/data/cutlass/include/cutlass/complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,386 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/thread 2026-04-14T21:02:57,387 copying build/lib/flashinfer/data/cutlass/include/cutlass/thread/matrix.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/thread 2026-04-14T21:02:57,390 copying build/lib/flashinfer/data/cutlass/include/cutlass/half.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,392 copying build/lib/flashinfer/data/cutlass/include/cutlass/array_planar_complex.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,394 copying build/lib/flashinfer/data/cutlass/include/cutlass/tensor_view.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,397 copying build/lib/flashinfer/data/cutlass/include/cutlass/numeric_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,399 copying build/lib/flashinfer/data/cutlass/include/cutlass/matrix_shape.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,401 copying build/lib/flashinfer/data/cutlass/include/cutlass/quaternion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,404 copying build/lib/flashinfer/data/cutlass/include/cutlass/float8.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,407 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:57,409 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/mainloop_fusion_helper_scale_factor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:57,411 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:57,413 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm100_mixed_dtype_blockwise_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:57,415 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/dependent_false.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:57,417 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/blockwise_scale_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:57,420 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/mma.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:57,421 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:57,424 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm103_blockscaled_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:57,427 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-14T21:02:57,428 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective/moe_stride_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-14T21:02:57,430 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective/sm103_kernel_type.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-14T21:02:57,432 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/collective/mixed_input_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail/collective 2026-04-14T21:02:57,434 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm100_tmem_helper.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:57,436 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/cluster.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:57,439 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/sm100_blockscaled_layout.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:57,441 copying build/lib/flashinfer/data/cutlass/include/cutlass/detail/helper_macros.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/detail 2026-04-14T21:02:57,443 copying build/lib/flashinfer/data/cutlass/include/cutlass/blas3.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,445 copying build/lib/flashinfer/data/cutlass/include/cutlass/fast_math.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,448 copying build/lib/flashinfer/data/cutlass/include/cutlass/tfloat32.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,451 copying build/lib/flashinfer/data/cutlass/include/cutlass/array_subbyte.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,453 copying build/lib/flashinfer/data/cutlass/include/cutlass/bfloat16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,457 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction 2026-04-14T21:02:57,458 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/threadblock_swizzle.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction 2026-04-14T21:02:57,460 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-14T21:02:57,461 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_contiguous.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-14T21:02:57,464 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-14T21:02:57,466 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_strided.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-14T21:02:57,469 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/device/reduce_split_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/device 2026-04-14T21:02:57,472 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-14T21:02:57,473 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread/reduction_operators.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-14T21:02:57,475 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/thread/reduce.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/thread 2026-04-14T21:02:57,478 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-14T21:02:57,479 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_contiguous.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-14T21:02:57,482 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_softmax_final.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-14T21:02:57,485 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_strided.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-14T21:02:57,487 copying build/lib/flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_split_k.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/reduction/kernel 2026-04-14T21:02:57,490 copying build/lib/flashinfer/data/cutlass/include/cutlass/relatively_equal.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,492 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,495 copying build/lib/flashinfer/data/cutlass/include/cutlass/barrier.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,497 copying build/lib/flashinfer/data/cutlass/include/cutlass/cuda_host_adapter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,500 copying build/lib/flashinfer/data/cutlass/include/cutlass/predicate_vector.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,502 copying build/lib/flashinfer/data/cutlass/include/cutlass/core_io.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,504 copying build/lib/flashinfer/data/cutlass/include/cutlass/pitch_linear_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,507 copying build/lib/flashinfer/data/cutlass/include/cutlass/matrix_coord.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,509 copying build/lib/flashinfer/data/cutlass/include/cutlass/kernel_launch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,512 creating build/bdist.linux-armv7l/wheel/flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-14T21:02:57,513 copying build/lib/flashinfer/data/cutlass/include/cutlass/pipeline/pipeline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-14T21:02:57,515 copying build/lib/flashinfer/data/cutlass/include/cutlass/pipeline/sm90_pipeline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-14T21:02:57,518 copying build/lib/flashinfer/data/cutlass/include/cutlass/pipeline/sm100_pipeline.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass/pipeline 2026-04-14T21:02:57,521 copying build/lib/flashinfer/data/cutlass/include/cutlass/trace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,523 copying build/lib/flashinfer/data/cutlass/include/cutlass/gemm_coord.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/cutlass/include/cutlass 2026-04-14T21:02:57,526 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include 2026-04-14T21:02:57,528 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer 2026-04-14T21:02:57,529 copying build/lib/flashinfer/data/include/flashinfer/topk.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,534 copying build/lib/flashinfer/data/include/flashinfer/fastdiv.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,536 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/norm 2026-04-14T21:02:57,537 copying build/lib/flashinfer/data/include/flashinfer/norm/ln_fwd_silu_kernel.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/norm 2026-04-14T21:02:57,540 copying build/lib/flashinfer/data/include/flashinfer/norm/ln_silu_headers.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/norm 2026-04-14T21:02:57,543 copying build/lib/flashinfer/data/include/flashinfer/math.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,545 copying build/lib/flashinfer/data/include/flashinfer/cubin_loader.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,548 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat 2026-04-14T21:02:57,549 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/ampere 2026-04-14T21:02:57,551 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-14T21:02:57,552 copying build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective/flat_collective_inverse.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-14T21:02:57,555 copying build/lib/flashinfer/data/include/flashinfer/flat/ampere/collective/flat_collective_load.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/ampere/collective 2026-04-14T21:02:57,557 copying build/lib/flashinfer/data/include/flashinfer/flat/unused.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-14T21:02:57,559 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/prefill 2026-04-14T21:02:57,560 copying build/lib/flashinfer/data/include/flashinfer/flat/prefill/prefill_kernel.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/prefill 2026-04-14T21:02:57,562 copying build/lib/flashinfer/data/include/flashinfer/flat/prefill/prefill_kernel_delta_rule_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/prefill 2026-04-14T21:02:57,565 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/hopper 2026-04-14T21:02:57,567 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/hopper/device 2026-04-14T21:02:57,568 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/device/device_universal.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/device 2026-04-14T21:02:57,571 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-14T21:02:57,572 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_store.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-14T21:02:57,575 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_named_barriers.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-14T21:02:57,577 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_load.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-14T21:02:57,579 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_common.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-14T21:02:57,582 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_tma_warpspecialized_delta_rule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/collective 2026-04-14T21:02:57,586 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-14T21:02:57,587 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_kernel_tma_warpspecialized_delta_rule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-14T21:02:57,589 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_options.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-14T21:02:57,591 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_kernel_builder_delta_rule.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-14T21:02:57,593 copying build/lib/flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat/hopper/kernel 2026-04-14T21:02:57,595 copying build/lib/flashinfer/data/include/flashinfer/flat/type_traits.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-14T21:02:57,597 copying build/lib/flashinfer/data/include/flashinfer/flat/cute_ext.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-14T21:02:57,599 copying build/lib/flashinfer/data/include/flashinfer/flat/math.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-14T21:02:57,601 copying build/lib/flashinfer/data/include/flashinfer/flat/math_order_barrier.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-14T21:02:57,603 copying build/lib/flashinfer/data/include/flashinfer/flat/common.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-14T21:02:57,605 copying build/lib/flashinfer/data/include/flashinfer/flat/debug.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/flat 2026-04-14T21:02:57,607 copying build/lib/flashinfer/data/include/flashinfer/fp16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,609 copying build/lib/flashinfer/data/include/flashinfer/vec_dtypes.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,612 copying build/lib/flashinfer/data/include/flashinfer/cutlass_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,614 copying build/lib/flashinfer/data/include/flashinfer/allocator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,616 copying build/lib/flashinfer/data/include/flashinfer/activation.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,618 copying build/lib/flashinfer/data/include/flashinfer/exception.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,620 copying build/lib/flashinfer/data/include/flashinfer/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,623 copying build/lib/flashinfer/data/include/flashinfer/quantization.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,625 copying build/lib/flashinfer/data/include/flashinfer/norm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,629 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,630 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,633 copying build/lib/flashinfer/data/include/flashinfer/gemm/tgv_gemm_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,635 copying build/lib/flashinfer/data/include/flashinfer/gemm/tgv_gemm_configs.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,637 copying build/lib/flashinfer/data/include/flashinfer/gemm/bmm_fp8.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,640 copying build/lib/flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,642 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,645 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,647 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_template_sm120.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,650 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm120.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,652 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass_template_sm120.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,654 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,656 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,658 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,661 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,663 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,665 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,668 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,671 copying build/lib/flashinfer/data/include/flashinfer/gemm/bf16_gemm_cutlass.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,672 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemv.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,675 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm103.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,677 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,680 copying build/lib/flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,682 copying build/lib/flashinfer/data/include/flashinfer/gemm/bf16_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,685 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm103.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,687 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,690 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp8_gemm_template_sm100.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,692 copying build/lib/flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,694 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_nvfp4_groupwise_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,697 copying build/lib/flashinfer/data/include/flashinfer/gemm/cutlass_gemm_configs.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,700 copying build/lib/flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm120.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,702 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_lora.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,704 copying build/lib/flashinfer/data/include/flashinfer/gemm/tgv_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,707 copying build/lib/flashinfer/data/include/flashinfer/gemm/dsv3_router_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,710 copying build/lib/flashinfer/data/include/flashinfer/gemm/group_gemm_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,712 copying build/lib/flashinfer/data/include/flashinfer/gemm/bf16_gemm_cutlass_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/gemm 2026-04-14T21:02:57,714 copying build/lib/flashinfer/data/include/flashinfer/attention_impl.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,716 copying build/lib/flashinfer/data/include/flashinfer/arch_condition.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,718 copying build/lib/flashinfer/data/include/flashinfer/sampling.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,724 copying build/lib/flashinfer/data/include/flashinfer/fp4_layout.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,726 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/comm 2026-04-14T21:02:57,728 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_allreduce_fusion.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-14T21:02:57,731 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_allreduce.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-14T21:02:57,734 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_moe_allreduce_fusion.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-14T21:02:57,738 copying build/lib/flashinfer/data/include/flashinfer/comm/vllm_custom_all_reduce.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-14T21:02:57,741 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_mnnvl_allreduce.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-14T21:02:57,744 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_alltoall.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-14T21:02:57,747 copying build/lib/flashinfer/data/include/flashinfer/comm/trtllm_alltoall_prepare.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/comm 2026-04-14T21:02:57,750 copying build/lib/flashinfer/data/include/flashinfer/pos_enc.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,753 copying build/lib/flashinfer/data/include/flashinfer/layout.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,756 copying build/lib/flashinfer/data/include/flashinfer/air_top_p.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,758 copying build/lib/flashinfer/data/include/flashinfer/permuted_smem.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,761 copying build/lib/flashinfer/data/include/flashinfer/concat_mla.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,764 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:57,766 copying build/lib/flashinfer/data/include/flashinfer/mamba/seq_chunk_cumsum.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:57,768 copying build/lib/flashinfer/data/include/flashinfer/mamba/create_tensor_map.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:57,771 copying build/lib/flashinfer/data/include/flashinfer/mamba/invoke_selective_state_update_mtp.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:57,774 copying build/lib/flashinfer/data/include/flashinfer/mamba/common.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:57,776 copying build/lib/flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_horizontal.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:57,779 copying build/lib/flashinfer/data/include/flashinfer/mamba/selective_state_update.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:57,781 copying build/lib/flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_vertical.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:57,785 copying build/lib/flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_stp.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:57,788 copying build/lib/flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_async_horizontal.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:57,791 copying build/lib/flashinfer/data/include/flashinfer/mamba/ssu_mtp_common.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:57,794 copying build/lib/flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_simple.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:57,797 copying build/lib/flashinfer/data/include/flashinfer/mamba/conversion.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/mamba 2026-04-14T21:02:57,799 copying build/lib/flashinfer/data/include/flashinfer/profiler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,802 copying build/lib/flashinfer/data/include/flashinfer/mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,805 copying build/lib/flashinfer/data/include/flashinfer/frag_layout_swizzle.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,808 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,809 copying build/lib/flashinfer/data/include/flashinfer/attention/state.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,811 copying build/lib/flashinfer/data/include/flashinfer/attention/mla.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,815 copying build/lib/flashinfer/data/include/flashinfer/attention/variants.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,817 copying build/lib/flashinfer/data/include/flashinfer/attention/persistent.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,820 copying build/lib/flashinfer/data/include/flashinfer/attention/decode_mla_cute_sm80.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,822 copying build/lib/flashinfer/data/include/flashinfer/attention/heap.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,825 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:57,826 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/sparse_mainloop.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:57,829 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/tile_scheduler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:57,832 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/kernel_traits.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:57,835 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/variants.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:57,837 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/mainloop.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:57,840 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:57,842 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/attention_updater.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:57,845 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/prefill_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:57,848 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/mainloop_mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:57,851 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/variant_helper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:57,853 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/epilogue.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:57,856 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/named_barrier.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:57,858 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/default_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper 2026-04-14T21:02:57,860 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-14T21:02:57,862 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/kernel_traits.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-14T21:02:57,864 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/prefill_sm90.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-14T21:02:57,867 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-14T21:02:57,869 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_load.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-14T21:02:57,872 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/epilogue.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-14T21:02:57,874 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_sparse_load.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/hopper/quantization 2026-04-14T21:02:57,876 copying build/lib/flashinfer/data/include/flashinfer/attention/mla_hopper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,879 copying build/lib/flashinfer/data/include/flashinfer/attention/default_prefill_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,882 copying build/lib/flashinfer/data/include/flashinfer/attention/mask.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,884 copying build/lib/flashinfer/data/include/flashinfer/attention/default_decode_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,887 copying build/lib/flashinfer/data/include/flashinfer/attention/hopper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,890 copying build/lib/flashinfer/data/include/flashinfer/attention/cutlass_mla.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,892 copying build/lib/flashinfer/data/include/flashinfer/attention/decode.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,895 copying build/lib/flashinfer/data/include/flashinfer/attention/cascade.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,898 copying build/lib/flashinfer/data/include/flashinfer/attention/pod.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,901 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell 2026-04-14T21:02:57,902 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/plan.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell 2026-04-14T21:02:57,905 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/fmha_cutlass_sm100.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell 2026-04-14T21:02:57,908 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-14T21:02:57,909 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device/fmha.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-14T21:02:57,911 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/device/sm100_mla.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/device 2026-04-14T21:02:57,914 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/common 2026-04-14T21:02:57,915 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/common/pow_2.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/common 2026-04-14T21:02:57,918 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-14T21:02:57,919 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_cpasync_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-14T21:02:57,922 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-14T21:02:57,924 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_epilogue_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-14T21:02:57,926 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_common.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-14T21:02:57,928 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_epilogue_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-14T21:02:57,930 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_mainloop_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-14T21:02:57,933 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_fusion.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-14T21:02:57,936 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_mainloop_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/collective 2026-04-14T21:02:57,939 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-14T21:02:57,940 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_options.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-14T21:02:57,942 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_mla_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-14T21:02:57,945 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_gen_kernel_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-14T21:02:57,948 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_tile_scheduler.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-14T21:02:57,950 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_reduction.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-14T21:02:57,952 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/gather_tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-14T21:02:57,954 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-14T21:02:57,958 copying build/lib/flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_fwd_kernel_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention/blackwell/kernel 2026-04-14T21:02:57,960 copying build/lib/flashinfer/data/include/flashinfer/attention/batch_pod.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,963 copying build/lib/flashinfer/data/include/flashinfer/attention/variant_helper.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,965 copying build/lib/flashinfer/data/include/flashinfer/attention/prefill.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,969 copying build/lib/flashinfer/data/include/flashinfer/attention/mla_params.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,971 copying build/lib/flashinfer/data/include/flashinfer/attention/scheduler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,975 copying build/lib/flashinfer/data/include/flashinfer/attention/persistent_template.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/attention 2026-04-14T21:02:57,977 copying build/lib/flashinfer/data/include/flashinfer/cp_async.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:57,979 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm 2026-04-14T21:02:57,981 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-14T21:02:57,982 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/IntFastDiv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-14T21:02:57,985 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/runner.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-14T21:02:57,987 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingCustomPolicy.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-14T21:02:57,990 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-14T21:02:57,993 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-14T21:02:57,996 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingDevKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-14T21:02:57,998 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/noAuxTcKernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-14T21:02:58,000 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernelTopK.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-14T21:02:58,003 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fused_moe/DevKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fused_moe 2026-04-14T21:02:58,006 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/common 2026-04-14T21:02:58,007 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-14T21:02:58,010 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Fallbacks.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-14T21:02:58,012 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/reduceKernelUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-14T21:02:58,015 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Wrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-14T21:02:58,017 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaTypeUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-14T21:02:58,019 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common/cudaFp8Utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/common 2026-04-14T21:02:58,022 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/batched_gemm 2026-04-14T21:02:58,023 copying build/lib/flashinfer/data/include/flashinfer/trtllm/batched_gemm/KernelRunner.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/batched_gemm 2026-04-14T21:02:58,026 copying build/lib/flashinfer/data/include/flashinfer/trtllm/common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm 2026-04-14T21:02:58,029 creating build/bdist.linux-armv7l/wheel/flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-14T21:02:58,031 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/kernelParams.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-14T21:02:58,034 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaKernels.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-14T21:02:58,038 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/decoder_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-14T21:02:58,040 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/lse.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-14T21:02:58,042 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunnerParams.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-14T21:02:58,045 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/kernelUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-14T21:02:58,048 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaReduction.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-14T21:02:58,050 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/decoder_impl_common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-14T21:02:58,053 copying build/lib/flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunner.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer/trtllm/fmha 2026-04-14T21:02:58,055 copying build/lib/flashinfer/data/include/flashinfer/logging.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:58,058 copying build/lib/flashinfer/data/include/flashinfer/page.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/include/flashinfer 2026-04-14T21:02:58,061 copying build/lib/flashinfer/data/build_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/data 2026-04-14T21:02:58,067 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc 2026-04-14T21:02:58,068 copying build/lib/flashinfer/data/csrc/trtllm_batched_gemm_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,072 copying build/lib/flashinfer/data/csrc/norm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,075 copying build/lib/flashinfer/data/csrc/single_decode_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,077 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm120.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,079 copying build/lib/flashinfer/data/csrc/single_prefill_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,081 copying build/lib/flashinfer/data/csrc/fmha_v2_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,085 copying build/lib/flashinfer/data/csrc/batch_prefill_sm90_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,087 copying build/lib/flashinfer/data/csrc/pod_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,089 copying build/lib/flashinfer/data/csrc/tgv_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,092 copying build/lib/flashinfer/data/csrc/batch_prefill_paged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,094 copying build/lib/flashinfer/data/csrc/flashinfer_cascade_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,096 copying build/lib/flashinfer/data/csrc/trtllm_alltoall_prepare.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,099 copying build/lib/flashinfer/data/csrc/single_prefill_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,101 copying build/lib/flashinfer/data/csrc/seq_chunk_cumsum.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,104 copying build/lib/flashinfer/data/csrc/vllm_custom_all_reduce.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,106 copying build/lib/flashinfer/data/csrc/batch_prefill_fp8_paged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,109 copying build/lib/flashinfer/data/csrc/batch_mla_sm90_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,111 copying build/lib/flashinfer/data/csrc/flashinfer_gemm_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,114 copying build/lib/flashinfer/data/csrc/rope.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,116 copying build/lib/flashinfer/data/csrc/batch_prefill_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,118 copying build/lib/flashinfer/data/csrc/batch_attention_paged_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,120 copying build/lib/flashinfer/data/csrc/gemm_sm120_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,122 copying build/lib/flashinfer/data/csrc/sampling_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,125 copying build/lib/flashinfer/data/csrc/single_decode_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,127 copying build/lib/flashinfer/data/csrc/group_gemm_sm100_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,129 copying build/lib/flashinfer/data/csrc/tvm_ffi_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,132 copying build/lib/flashinfer/data/csrc/trtllm_low_latency_gemm_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,134 copying build/lib/flashinfer/data/csrc/selective_state_update_kernel_inst.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,136 copying build/lib/flashinfer/data/csrc/batch_mla_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,139 copying build/lib/flashinfer/data/csrc/batch_mla_sm90_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,142 copying build/lib/flashinfer/data/csrc/batch_pod_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,144 copying build/lib/flashinfer/data/csrc/flashinfer_rope_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,147 copying build/lib/flashinfer/data/csrc/batch_prefill_fp8_ragged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,150 copying build/lib/flashinfer/data/csrc/gemm_sm100_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,152 copying build/lib/flashinfer/data/csrc/group_gemm_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,154 copying build/lib/flashinfer/data/csrc/blackwell_fmha_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,156 copying build/lib/flashinfer/data/csrc/batch_decode_mla_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,158 copying build/lib/flashinfer/data/csrc/selective_state_update.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,161 copying build/lib/flashinfer/data/csrc/fmhaReduction.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,163 copying build/lib/flashinfer/data/csrc/batch_decode_mla_cute_sm80.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,166 copying build/lib/flashinfer/data/csrc/rmsnorm_silu.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,168 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe 2026-04-14T21:02:58,169 copying build/lib/flashinfer/data/csrc/fused_moe/noAuxTcKernels.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe 2026-04-14T21:02:58,173 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-14T21:02:58,174 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-14T21:02:58,177 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/deepgemm_jit_setup.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-14T21:02:58,179 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_kernels.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-14T21:02:58,185 copying build/lib/flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_instantiation.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/cutlass_backend 2026-04-14T21:02:58,188 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-14T21:02:58,189 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_llama4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-14T21:02:58,192 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_deepseek.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-14T21:02:58,194 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_dev_kernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-14T21:02:58,197 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_common.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-14T21:02:58,200 copying build/lib/flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_custom.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe/trtllm_backend 2026-04-14T21:02:58,203 copying build/lib/flashinfer/data/csrc/fused_moe/moeTopKFuncs.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fused_moe 2026-04-14T21:02:58,206 copying build/lib/flashinfer/data/csrc/renorm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,208 copying build/lib/flashinfer/data/csrc/mxfp8_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,210 copying build/lib/flashinfer/data/csrc/cutlass_mla.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,212 copying build/lib/flashinfer/data/csrc/page.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,214 copying build/lib/flashinfer/data/csrc/single_prefill.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,216 copying build/lib/flashinfer/data/csrc/fp8_blockscale_gemm_sm90_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,218 copying build/lib/flashinfer/data/csrc/batch_attention_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,220 copying build/lib/flashinfer/data/csrc/runtime_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,222 copying build/lib/flashinfer/data/csrc/single_prefill_fp8_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,224 copying build/lib/flashinfer/data/csrc/group_gemm_sm120_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,225 copying build/lib/flashinfer/data/csrc/single_prefill_sm90_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,227 copying build/lib/flashinfer/data/csrc/flashinfer_xqa_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,229 copying build/lib/flashinfer/data/csrc/flashinfer_quantization_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,231 copying build/lib/flashinfer/data/csrc/batch_pod_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,233 copying build/lib/flashinfer/data/csrc/selective_state_update_dtype_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,235 copying build/lib/flashinfer/data/csrc/trtllm_moe_alltoall.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,237 copying build/lib/flashinfer/data/csrc/gdn_prefill_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,240 copying build/lib/flashinfer/data/csrc/bf16_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,242 copying build/lib/flashinfer/data/csrc/batch_pod.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,244 copying build/lib/flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,246 copying build/lib/flashinfer/data/csrc/batch_prefill_ragged_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,248 copying build/lib/flashinfer/data/csrc/moe_utils_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,251 copying build/lib/flashinfer/data/csrc/concat_mla.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,253 copying build/lib/flashinfer/data/csrc/flashinfer_rmsnorm_silu_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,255 copying build/lib/flashinfer/data/csrc/batch_decode_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,256 copying build/lib/flashinfer/data/csrc/group_gemm_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,258 copying build/lib/flashinfer/data/csrc/group_gemm_nvfp4_groupwise_sm120_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,260 copying build/lib/flashinfer/data/csrc/flashinfer_page_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,262 copying build/lib/flashinfer/data/csrc/batch_pod_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,264 copying build/lib/flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,267 copying build/lib/flashinfer/data/csrc/trtllm_fused_moe_kernel_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,270 copying build/lib/flashinfer/data/csrc/flashinfer_topk_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,272 copying build/lib/flashinfer/data/csrc/mxfp8_gemm_cutlass_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,275 copying build/lib/flashinfer/data/csrc/prefill_kernel_delta_rule_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,278 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/xqa 2026-04-14T21:02:58,279 copying build/lib/flashinfer/data/csrc/xqa/barriers.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,282 copying build/lib/flashinfer/data/csrc/xqa/utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,284 copying build/lib/flashinfer/data/csrc/xqa/tensorMap.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,286 copying build/lib/flashinfer/data/csrc/xqa/mla_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,289 copying build/lib/flashinfer/data/csrc/xqa/defines.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,291 copying build/lib/flashinfer/data/csrc/xqa/platform.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,293 copying build/lib/flashinfer/data/csrc/xqa/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,296 copying build/lib/flashinfer/data/csrc/xqa/mhaUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,299 copying build/lib/flashinfer/data/csrc/xqa/gmma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,301 copying build/lib/flashinfer/data/csrc/xqa/mha.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,303 copying build/lib/flashinfer/data/csrc/xqa/mha_components.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,306 copying build/lib/flashinfer/data/csrc/xqa/ldgsts.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,308 copying build/lib/flashinfer/data/csrc/xqa/mha.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,312 copying build/lib/flashinfer/data/csrc/xqa/tensorMap.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,315 copying build/lib/flashinfer/data/csrc/xqa/mha_stdheaders.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,317 copying build/lib/flashinfer/data/csrc/xqa/mha_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,321 copying build/lib/flashinfer/data/csrc/xqa/mla_sm120.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,323 copying build/lib/flashinfer/data/csrc/xqa/mma.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,325 copying build/lib/flashinfer/data/csrc/xqa/xqa_wrapper.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,327 copying build/lib/flashinfer/data/csrc/xqa/tma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,329 copying build/lib/flashinfer/data/csrc/xqa/hostUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,331 copying build/lib/flashinfer/data/csrc/xqa/cuda_hint.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,333 copying build/lib/flashinfer/data/csrc/xqa/specDec.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,335 copying build/lib/flashinfer/data/csrc/xqa/gmma_impl.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/xqa 2026-04-14T21:02:58,342 copying build/lib/flashinfer/data/csrc/bmm_fp8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,343 copying build/lib/flashinfer/data/csrc/batch_decode_mla_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,345 copying build/lib/flashinfer/data/csrc/batch_mla_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,347 copying build/lib/flashinfer/data/csrc/pod.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,349 copying build/lib/flashinfer/data/csrc/trtllm_fused_moe_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,352 copying build/lib/flashinfer/data/csrc/batch_mla_sm90_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,354 copying build/lib/flashinfer/data/csrc/batch_prefill_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,356 copying build/lib/flashinfer/data/csrc/batch_decode_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,359 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal 2026-04-14T21:02:58,360 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/cpp 2026-04-14T21:02:58,362 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/cpp/common 2026-04-14T21:02:58,363 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/memoryUtils.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-14T21:02:58,366 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/envUtils.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-14T21:02:58,368 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/tllmException.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-14T21:02:58,370 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/logger.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-14T21:02:58,372 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/common/stringUtils.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/common 2026-04-14T21:02:58,374 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/cpp/kernels 2026-04-14T21:02:58,375 copying build/lib/flashinfer/data/csrc/nv_internal/cpp/kernels/quantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/cpp/kernels 2026-04-14T21:02:58,378 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm 2026-04-14T21:02:58,380 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions 2026-04-14T21:02:58,382 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include 2026-04-14T21:02:58,383 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-14T21:02:58,385 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/interleaved_numeric_conversion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-14T21:02:58,388 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue 2026-04-14T21:02:58,389 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2026-04-14T21:02:58,391 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread/fused_activations.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread 2026-04-14T21:02:58,393 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2026-04-14T21:02:58,394 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective/epilogue_moe_finalize.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective 2026-04-14T21:02:58,397 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-14T21:02:58,398 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_allreduce_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-14T21:02:58,401 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_scatter.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion 2026-04-14T21:02:58,404 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/tile_interleaved_layout.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-14T21:02:58,406 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-14T21:02:58,407 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_traits_sm90_multimem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-14T21:02:58,410 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-14T21:02:58,411 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_red_global.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-14T21:02:58,414 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/grid_dependency_control.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-14T21:02:58,416 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_sm90_multimem.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch 2026-04-14T21:02:58,418 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/system_barrier.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-14T21:02:58,420 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm_configs.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-14T21:02:58,424 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication 2026-04-14T21:02:58,425 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2026-04-14T21:02:58,427 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective/sm90_allreduce_nvls_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective 2026-04-14T21:02:58,429 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/compute_occupancy.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-14T21:02:58,432 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm 2026-04-14T21:02:58,434 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:58,435 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:58,438 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma_bf16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:58,440 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:58,442 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_percol.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:58,445 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_base.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:58,447 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_percol.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:58,450 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_finegrained.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:58,453 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_finegrained.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:58,455 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:58,458 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:58,460 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_multistage.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:58,463 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock 2026-04-14T21:02:58,466 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-14T21:02:58,467 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/default_mma_tensor_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-14T21:02:58,470 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_dequantizer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-14T21:02:58,473 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_compute_B_with_f16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp 2026-04-14T21:02:58,476 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:58,478 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized_fp8.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:58,481 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:58,483 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input_.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:58,487 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_gated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:58,489 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:58,491 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-14T21:02:58,493 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_interleaved.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-14T21:02:58,495 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_mixed_input.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-14T21:02:58,498 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_gated.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders 2026-04-14T21:02:58,500 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_interleaved_tma_gmma_rs_warpspecialized_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:58,504 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_interleaved.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:58,506 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_interleaved.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:58,508 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_array_mixed_input.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:58,510 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_gated.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective 2026-04-14T21:02:58,512 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:58,514 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:58,516 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized_pingpong.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:58,519 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cutlass_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:58,522 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/default_fpA_intB_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:58,525 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cute_util.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:58,527 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/mixed_gemm_B_layout.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:58,530 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_traits.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:58,532 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_moe_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:58,534 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_routine.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:58,537 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_universal_allreduce.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:58,539 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:58,541 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_problem_visitor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel 2026-04-14T21:02:58,544 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform 2026-04-14T21:02:58,546 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2026-04-14T21:02:58,547 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock/fine_grained_scale_zero_iterator.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock 2026-04-14T21:02:58,550 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail 2026-04-14T21:02:58,551 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2026-04-14T21:02:58,553 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective/mixed_input_utils.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective 2026-04-14T21:02:58,555 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/weight_only_quant_op.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-14T21:02:58,558 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue_helpers.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions 2026-04-14T21:02:58,560 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2026-04-14T21:02:58,561 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util/gather_tensor.hpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util 2026-04-14T21:02:58,564 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-14T21:02:58,565 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaDriverWrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-14T21:02:58,568 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/envUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-14T21:02:58,570 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaBf16Fallbacks.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-14T21:02:58,572 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/reduceKernelUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-14T21:02:58,575 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cublasMMWrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-14T21:02:58,577 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/quantTypeUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-14T21:02:58,579 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/memoryUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-14T21:02:58,581 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/workspace.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-14T21:02:58,583 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaTypeUtils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/common 2026-04-14T21:02:58,586 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-14T21:02:58,588 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-14T21:02:58,590 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-14T21:02:58,594 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels 2026-04-14T21:02:58,597 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-14T21:02:58,599 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-14T21:02:58,601 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora 2026-04-14T21:02:58,604 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-14T21:02:58,607 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-14T21:02:58,609 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-14T21:02:58,612 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-14T21:02:58,616 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-14T21:02:58,618 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-14T21:02:58,620 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_mma_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-14T21:02:58,624 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-14T21:02:58,625 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_fp8_gemm_1d1d.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-14T21:02:58,629 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm 2026-04-14T21:02:58,632 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-14T21:02:58,635 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm_kernel.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-14T21:02:58,640 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_tma_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-14T21:02:58,643 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm 2026-04-14T21:02:58,646 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_type_conversion.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-14T21:02:58,649 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-14T21:02:58,652 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,654 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,657 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_f16_out_f16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,659 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,662 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,665 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,667 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_bf16_out_bf16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,670 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,672 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_bf16_out_bf16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,675 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,677 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,679 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scaleonly.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,682 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,684 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,687 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scalebias.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,689 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,691 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template_sm90.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,694 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,696 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_per_col_f16_out_f16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,698 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_f16_out_f16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,699 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_per_col.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm 2026-04-14T21:02:58,702 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-14T21:02:58,703 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-14T21:02:58,706 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers 2026-04-14T21:02:58,707 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels 2026-04-14T21:02:58,711 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-14T21:02:58,712 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_kernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-14T21:02:58,715 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_gemm_kernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-14T21:02:58,717 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/common.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-14T21:02:58,719 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/cutlass_kernel_selector.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-14T21:02:58,721 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_util_kernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include 2026-04-14T21:02:58,724 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,725 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,728 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,731 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,733 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_bf16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,734 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,736 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,738 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,740 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws_mixed_dtype.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,743 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_kernels.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,745 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,747 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,748 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_uint4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,750 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_tma_warp_specialized_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,753 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_tma_warp_specialized_input.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,755 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp32_fp32.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,757 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp16.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,759 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint8.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,760 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,762 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,764 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp4_fp4.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm 2026-04-14T21:02:58,766 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-14T21:02:58,767 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-14T21:02:58,769 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-14T21:02:58,771 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-14T21:02:58,773 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-14T21:02:58,776 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.inl -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-14T21:02:58,781 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers 2026-04-14T21:02:58,784 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-14T21:02:58,785 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-14T21:02:58,787 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels 2026-04-14T21:02:58,790 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-14T21:02:58,792 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-14T21:02:58,794 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels 2026-04-14T21:02:58,798 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:58,799 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_cutlass.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:58,803 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/runtime.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:58,805 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/tma_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:58,808 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:58,811 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:58,813 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_std.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:58,815 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/jit_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:58,818 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/mma_utils.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:58,821 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/scheduler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:58,824 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm_impl.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:58,827 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/compiler.cuh -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm 2026-04-14T21:02:58,831 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-14T21:02:58,832 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-14T21:02:58,835 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-14T21:02:58,837 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-14T21:02:58,840 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-14T21:02:58,842 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Op.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-14T21:02:58,844 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.cpp -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-14T21:02:58,847 copying build/lib/flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/moeAlltoAllMeta.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/tensorrt_llm/thop 2026-04-14T21:02:58,850 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/include 2026-04-14T21:02:58,852 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/include/tensorrt_llm 2026-04-14T21:02:58,854 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:58,855 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:58,859 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/config.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:58,861 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/logger.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:58,864 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/quantization.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:58,866 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaBf16Wrapper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:58,868 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/NvInferRuntime.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:58,871 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaFp8Utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:58,873 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/assert.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:58,876 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/stringUtils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:58,878 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/dataType.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:58,881 copying build/lib/flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/tllmException.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common 2026-04-14T21:02:58,883 copying build/lib/flashinfer/data/csrc/cascade.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,885 copying build/lib/flashinfer/data/csrc/single_prefill_fp8_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,888 copying build/lib/flashinfer/data/csrc/topk.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,891 copying build/lib/flashinfer/data/csrc/single_decode_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,893 copying build/lib/flashinfer/data/csrc/single_prefill_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,895 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,898 copying build/lib/flashinfer/data/csrc/fp8_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,900 copying build/lib/flashinfer/data/csrc/batch_mla_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,902 copying build/lib/flashinfer/data/csrc/bf16_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,905 copying build/lib/flashinfer/data/csrc/gdn_prefill_sm90_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,907 copying build/lib/flashinfer/data/csrc/trtllm_moe_allreduce_fusion.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,909 copying build/lib/flashinfer/data/csrc/batch_prefill_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,912 copying build/lib/flashinfer/data/csrc/seq_chunk_cumsum_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,914 copying build/lib/flashinfer/data/csrc/dsv3_router_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,917 copying build/lib/flashinfer/data/csrc/flat_prefill_kernel_delta_rule_sm90_extern.inc -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,919 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm103.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,921 copying build/lib/flashinfer/data/csrc/mxfp8_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,923 copying build/lib/flashinfer/data/csrc/single_decode.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,926 copying build/lib/flashinfer/data/csrc/flashinfer_sampling_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,928 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,930 copying build/lib/flashinfer/data/csrc/sampling.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,933 copying build/lib/flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,935 copying build/lib/flashinfer/data/csrc/batch_prefill.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,938 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm103.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,941 copying build/lib/flashinfer/data/csrc/group_gemm_nvfp4_groupwise_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,943 copying build/lib/flashinfer/data/csrc/flashinfer_gemm_sm90_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,946 copying build/lib/flashinfer/data/csrc/batch_decode.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,948 copying build/lib/flashinfer/data/csrc/batch_prefill_ragged_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,950 copying build/lib/flashinfer/data/csrc/trtllm_allreduce.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,952 copying build/lib/flashinfer/data/csrc/batch_attention_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,955 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,957 copying build/lib/flashinfer/data/csrc/flashinfer_mla_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,959 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass_sm120.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,961 copying build/lib/flashinfer/data/csrc/trtllm_gemm_runner.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,964 copying build/lib/flashinfer/data/csrc/single_prefill_sm90_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,966 copying build/lib/flashinfer/data/csrc/flashinfer_norm_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,968 copying build/lib/flashinfer/data/csrc/fp4_kv_dequantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,970 copying build/lib/flashinfer/data/csrc/tinygemm2.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,973 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm120_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,975 copying build/lib/flashinfer/data/csrc/fp8_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,977 copying build/lib/flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm120_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,979 copying build/lib/flashinfer/data/csrc/logging.cc -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,981 copying build/lib/flashinfer/data/csrc/group_gemm.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,983 copying build/lib/flashinfer/data/csrc/trtllm_fmha_v2_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,986 copying build/lib/flashinfer/data/csrc/batch_attention.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,988 copying build/lib/flashinfer/data/csrc/trtllm_alltoall.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,991 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm100_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,993 copying build/lib/flashinfer/data/csrc/fmha_v2_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:58,996 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:58,997 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:59,000 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:59,002 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:59,005 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:59,008 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:59,011 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:59,013 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop_tiled.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:59,016 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN_multi_cta.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:59,018 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:59,021 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:59,024 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:59,026 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:59,028 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:59,031 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper_noloop.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:59,034 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2/templates 2026-04-14T21:02:59,035 copying build/lib/flashinfer/data/csrc/fmha_v2/templates/kernel.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/templates 2026-04-14T21:02:59,038 copying build/lib/flashinfer/data/csrc/fmha_v2/templates/kernel_hopper.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/templates 2026-04-14T21:02:59,040 copying build/lib/flashinfer/data/csrc/fmha_v2/templates/fa_kernel.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/templates 2026-04-14T21:02:59,043 copying build/lib/flashinfer/data/csrc/fmha_v2/templates/kernel_hopper_ws.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/templates 2026-04-14T21:02:59,045 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_demo_bert_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:59,048 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_2x2.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:59,050 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:59,052 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/paged_kv_cache.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:59,054 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/softmax.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:59,059 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_o_packed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:59,063 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gemm.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:59,065 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_ps.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:59,068 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/smem_tile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:59,072 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_o.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:59,075 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/fragment.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:59,079 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:59,083 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/kernel_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:59,086 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/mask.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:59,089 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:59,090 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/gmem_tile_o_packed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:59,094 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/smem_tile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:59,098 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_hgmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:59,101 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/fragment.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:59,104 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/gmma_descriptor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:59,107 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/compute_tile.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:59,109 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/kernel_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:59,112 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_gmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:59,113 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/tma_descriptor.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:59,116 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/gmem_tile_qkv_packed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:59,118 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_hgmma_bf16.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:59,121 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_tma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:59,123 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_igmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:59,126 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/arrive_wait.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:59,128 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_qgmma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:59,132 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/tma_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:59,135 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/smem_tile_o.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:59,137 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_warpgroup.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/hopper 2026-04-14T21:02:59,139 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:59,142 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_qkv_packed.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:59,145 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/numeric_types.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:59,147 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/alibi_params.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:59,149 creating build/bdist.linux-armv7l/wheel/flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-14T21:02:59,150 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/circular_buffer.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-14T21:02:59,153 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/kernel_traits.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-14T21:02:59,156 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/compute.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-14T21:02:59,159 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/epilogue.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-14T21:02:59,162 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/warpspec/dma.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha/warpspec 2026-04-14T21:02:59,165 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_qkv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:59,167 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/smem_tile_v.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:59,170 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/smem_tile_o.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:59,173 copying build/lib/flashinfer/data/csrc/fmha_v2/fmha/smem_tile_qkv.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2/fmha 2026-04-14T21:02:59,175 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_attention.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:59,178 copying build/lib/flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc/fmha_v2 2026-04-14T21:02:59,180 copying build/lib/flashinfer/data/csrc/gemm_groupwise_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,183 copying build/lib/flashinfer/data/csrc/pod_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,185 copying build/lib/flashinfer/data/csrc/single_prefill_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,187 copying build/lib/flashinfer/data/csrc/mxfp8_gemm_cutlass_sm120.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,189 copying build/lib/flashinfer/data/csrc/selective_state_update_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,191 copying build/lib/flashinfer/data/csrc/batch_mla_plan.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,193 copying build/lib/flashinfer/data/csrc/tgv_gemm.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,195 copying build/lib/flashinfer/data/csrc/batch_prefill_sm90_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,197 copying build/lib/flashinfer/data/csrc/trtllm_fmha_kernel_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,200 copying build/lib/flashinfer/data/csrc/trtllm_allreduce_fusion.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,203 copying build/lib/flashinfer/data/csrc/batch_prefill_paged_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,205 copying build/lib/flashinfer/data/csrc/flashinfer_mamba_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,207 copying build/lib/flashinfer/data/csrc/fmha_cutlass_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,209 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,211 copying build/lib/flashinfer/data/csrc/fmha_cutlass_sm100_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,213 copying build/lib/flashinfer/data/csrc/batch_decode_mla_run.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,216 copying build/lib/flashinfer/data/csrc/single_prefill_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,218 copying build/lib/flashinfer/data/csrc/batch_prefill_fp8_sm90.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,220 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,222 copying build/lib/flashinfer/data/csrc/quantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,225 copying build/lib/flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120_kernel_inst.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,227 copying build/lib/flashinfer/data/csrc/fp4_gemm_cutlass.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,230 copying build/lib/flashinfer/data/csrc/batch_decode_mla_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,232 copying build/lib/flashinfer/data/csrc/pod_customize_config.jinja -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,234 copying build/lib/flashinfer/data/csrc/cudnn_sdpa_kernel_launcher.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,238 copying build/lib/flashinfer/data/csrc/trtllm_mnnvl_allreduce.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,241 copying build/lib/flashinfer/data/csrc/cudnn_sdpa_utils.h -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,243 copying build/lib/flashinfer/data/csrc/batch_decode_jit_binding.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,245 copying build/lib/flashinfer/data/csrc/fp4_kv_quantization.cu -> build/bdist.linux-armv7l/wheel/./flashinfer/data/csrc 2026-04-14T21:02:59,249 copying build/lib/flashinfer/trtllm_low_latency_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:59,252 creating build/bdist.linux-armv7l/wheel/flashinfer/gemm 2026-04-14T21:02:59,253 copying build/lib/flashinfer/gemm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm 2026-04-14T21:02:59,256 copying build/lib/flashinfer/gemm/gemm_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm 2026-04-14T21:02:59,263 copying build/lib/flashinfer/gemm/routergemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm 2026-04-14T21:02:59,266 creating build/bdist.linux-armv7l/wheel/flashinfer/gemm/kernels 2026-04-14T21:02:59,267 copying build/lib/flashinfer/gemm/kernels/grouped_gemm_masked_blackwell.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-14T21:02:59,273 copying build/lib/flashinfer/gemm/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-14T21:02:59,275 copying build/lib/flashinfer/gemm/kernels/dense_blockscaled_gemm_sm100.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-14T21:02:59,280 copying build/lib/flashinfer/gemm/kernels/dense_blockscaled_gemm_sm103.py -> build/bdist.linux-armv7l/wheel/./flashinfer/gemm/kernels 2026-04-14T21:02:59,285 creating build/bdist.linux-armv7l/wheel/flashinfer/profiler 2026-04-14T21:02:59,286 copying build/lib/flashinfer/profiler/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/profiler 2026-04-14T21:02:59,289 copying build/lib/flashinfer/autotuner.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:59,293 copying build/lib/flashinfer/cuda_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:59,295 copying build/lib/flashinfer/fp4_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:59,297 copying build/lib/flashinfer/prefill.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:59,303 creating build/bdist.linux-armv7l/wheel/flashinfer/comm 2026-04-14T21:02:59,304 copying build/lib/flashinfer/comm/allreduce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-14T21:02:59,307 copying build/lib/flashinfer/comm/dlpack_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-14T21:02:59,310 copying build/lib/flashinfer/comm/mapping.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-14T21:02:59,313 copying build/lib/flashinfer/comm/trtllm_moe_alltoall.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-14T21:02:59,316 copying build/lib/flashinfer/comm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-14T21:02:59,319 copying build/lib/flashinfer/comm/vllm_ar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-14T21:02:59,321 copying build/lib/flashinfer/comm/workspace_base.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-14T21:02:59,324 copying build/lib/flashinfer/comm/trtllm_ar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-14T21:02:59,327 copying build/lib/flashinfer/comm/nvshmem.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-14T21:02:59,329 copying build/lib/flashinfer/comm/mnnvl.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-14T21:02:59,333 copying build/lib/flashinfer/comm/nvshmem_allreduce.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-14T21:02:59,335 copying build/lib/flashinfer/comm/trtllm_mnnvl_ar.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-14T21:02:59,339 copying build/lib/flashinfer/comm/cuda_ipc.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-14T21:02:59,342 copying build/lib/flashinfer/comm/trtllm_alltoall.py -> build/bdist.linux-armv7l/wheel/./flashinfer/comm 2026-04-14T21:02:59,345 copying build/lib/flashinfer/concat_ops.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:59,348 creating build/bdist.linux-armv7l/wheel/flashinfer/triton 2026-04-14T21:02:59,349 copying build/lib/flashinfer/triton/norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-14T21:02:59,351 copying build/lib/flashinfer/triton/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-14T21:02:59,353 copying build/lib/flashinfer/triton/gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-14T21:02:59,355 copying build/lib/flashinfer/triton/sm_constraint_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-14T21:02:59,357 creating build/bdist.linux-armv7l/wheel/flashinfer/triton/kernels 2026-04-14T21:02:59,358 copying build/lib/flashinfer/triton/kernels/norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-14T21:02:59,361 copying build/lib/flashinfer/triton/kernels/ssd_chunk_state.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-14T21:02:59,363 copying build/lib/flashinfer/triton/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-14T21:02:59,364 copying build/lib/flashinfer/triton/kernels/sm_constraint_gemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-14T21:02:59,367 copying build/lib/flashinfer/triton/kernels/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-14T21:02:59,369 copying build/lib/flashinfer/triton/kernels/quant.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-14T21:02:59,370 copying build/lib/flashinfer/triton/kernels/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton/kernels 2026-04-14T21:02:59,373 copying build/lib/flashinfer/triton/page.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-14T21:02:59,374 copying build/lib/flashinfer/triton/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-14T21:02:59,376 copying build/lib/flashinfer/triton/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-14T21:02:59,378 copying build/lib/flashinfer/triton/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/triton 2026-04-14T21:02:59,380 copying build/lib/flashinfer/page.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:59,383 copying build/lib/flashinfer/rope.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:59,386 copying build/lib/flashinfer/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:59,388 copying build/lib/flashinfer/artifacts.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:59,391 creating build/bdist.linux-armv7l/wheel/flashinfer/mla 2026-04-14T21:02:59,392 copying build/lib/flashinfer/mla/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mla 2026-04-14T21:02:59,394 copying build/lib/flashinfer/mla/_core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mla 2026-04-14T21:02:59,398 copying build/lib/flashinfer/topk.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:59,401 copying build/lib/flashinfer/green_ctx.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:59,404 creating build/bdist.linux-armv7l/wheel/flashinfer/dsv3_ops 2026-04-14T21:02:59,405 copying build/lib/flashinfer/dsv3_ops/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/dsv3_ops 2026-04-14T21:02:59,408 copying build/lib/flashinfer/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:59,411 copying build/lib/flashinfer/aot.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:59,415 creating build/bdist.linux-armv7l/wheel/flashinfer/mamba 2026-04-14T21:02:59,416 copying build/lib/flashinfer/mamba/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-14T21:02:59,419 copying build/lib/flashinfer/mamba/ssd_tile_scheduler.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-14T21:02:59,422 copying build/lib/flashinfer/mamba/ssd_kernel.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-14T21:02:59,428 copying build/lib/flashinfer/mamba/selective_state_update.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-14T21:02:59,431 copying build/lib/flashinfer/mamba/ssd_combined.py -> build/bdist.linux-armv7l/wheel/./flashinfer/mamba 2026-04-14T21:02:59,436 creating build/bdist.linux-armv7l/wheel/flashinfer/jit 2026-04-14T21:02:59,438 copying build/lib/flashinfer/jit/sampling.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,440 copying build/lib/flashinfer/jit/norm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,442 copying build/lib/flashinfer/jit/comm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,444 copying build/lib/flashinfer/jit/fp4_kv_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,447 copying build/lib/flashinfer/jit/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,449 copying build/lib/flashinfer/jit/fused_moe.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,452 copying build/lib/flashinfer/jit/fp8_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,455 copying build/lib/flashinfer/jit/mla.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,457 copying build/lib/flashinfer/jit/quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,459 copying build/lib/flashinfer/jit/xqa.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,462 copying build/lib/flashinfer/jit/gdn.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,464 copying build/lib/flashinfer/jit/spdlog.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,467 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/gemm 2026-04-14T21:02:59,469 copying build/lib/flashinfer/jit/gemm/fp8_blockscale.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2026-04-14T21:02:59,472 copying build/lib/flashinfer/jit/gemm/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2026-04-14T21:02:59,474 copying build/lib/flashinfer/jit/gemm/deepgemm.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2026-04-14T21:02:59,477 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/gemm/cutlass 2026-04-14T21:02:59,478 copying build/lib/flashinfer/jit/gemm/cutlass/cutlass_library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm/cutlass 2026-04-14T21:02:59,482 copying build/lib/flashinfer/jit/gemm/cutlass/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm/cutlass 2026-04-14T21:02:59,484 copying build/lib/flashinfer/jit/gemm/cutlass/generate_kernels.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm/cutlass 2026-04-14T21:02:59,488 copying build/lib/flashinfer/jit/gemm/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/gemm 2026-04-14T21:02:59,491 copying build/lib/flashinfer/jit/moe_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,494 copying build/lib/flashinfer/jit/fp4_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,496 copying build/lib/flashinfer/jit/cpp_ext.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,498 copying build/lib/flashinfer/jit/page.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,500 copying build/lib/flashinfer/jit/fp4_kv_dequantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,501 copying build/lib/flashinfer/jit/rope.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,503 copying build/lib/flashinfer/jit/activation.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,505 copying build/lib/flashinfer/jit/env.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,507 copying build/lib/flashinfer/jit/topk.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,509 copying build/lib/flashinfer/jit/cascade.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,511 copying build/lib/flashinfer/jit/tinygemm2.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,513 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/mamba 2026-04-14T21:02:59,514 copying build/lib/flashinfer/jit/mamba/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/mamba 2026-04-14T21:02:59,516 copying build/lib/flashinfer/jit/mamba/seq_chunk_cumsum.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/mamba 2026-04-14T21:02:59,518 copying build/lib/flashinfer/jit/mamba/selective_state_update.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/mamba 2026-04-14T21:02:59,520 copying build/lib/flashinfer/jit/core.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,523 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/attention 2026-04-14T21:02:59,524 copying build/lib/flashinfer/jit/attention/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2026-04-14T21:02:59,526 copying build/lib/flashinfer/jit/attention/modules.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2026-04-14T21:02:59,530 copying build/lib/flashinfer/jit/attention/variants.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2026-04-14T21:02:59,532 creating build/bdist.linux-armv7l/wheel/flashinfer/jit/attention/fmha_v2 2026-04-14T21:02:59,533 copying build/lib/flashinfer/jit/attention/fmha_v2/generate_kernels.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention/fmha_v2 2026-04-14T21:02:59,536 copying build/lib/flashinfer/jit/attention/fmha_v2/fmha_library.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention/fmha_v2 2026-04-14T21:02:59,539 copying build/lib/flashinfer/jit/attention/fmha_v2/generator_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention/fmha_v2 2026-04-14T21:02:59,544 copying build/lib/flashinfer/jit/attention/fmha_v2/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention/fmha_v2 2026-04-14T21:02:59,547 copying build/lib/flashinfer/jit/attention/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit/attention 2026-04-14T21:02:59,549 copying build/lib/flashinfer/jit/cubin_loader.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,551 copying build/lib/flashinfer/jit/tllm_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,553 copying build/lib/flashinfer/jit/dsv3_optimizations.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,555 copying build/lib/flashinfer/jit/rmsnorm_silu.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,558 copying build/lib/flashinfer/jit/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/jit 2026-04-14T21:02:59,560 copying build/lib/flashinfer/__main__.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:59,562 copying build/lib/flashinfer/api_logging.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:59,565 copying build/lib/flashinfer/attention.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:59,568 copying build/lib/flashinfer/py.typed -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:59,569 copying build/lib/flashinfer/tllm_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:59,571 copying build/lib/flashinfer/utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer 2026-04-14T21:02:59,575 creating build/bdist.linux-armv7l/wheel/flashinfer/quantization 2026-04-14T21:02:59,576 copying build/lib/flashinfer/quantization/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-14T21:02:59,578 copying build/lib/flashinfer/quantization/fp8_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-14T21:02:59,580 creating build/bdist.linux-armv7l/wheel/flashinfer/quantization/kernels 2026-04-14T21:02:59,581 copying build/lib/flashinfer/quantization/kernels/mxfp4_quantize.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization/kernels 2026-04-14T21:02:59,584 copying build/lib/flashinfer/quantization/kernels/__init__.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization/kernels 2026-04-14T21:02:59,586 copying build/lib/flashinfer/quantization/kernels/nvfp4_quantize.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization/kernels 2026-04-14T21:02:59,589 copying build/lib/flashinfer/quantization/kernels/mxfp8_quantize.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization/kernels 2026-04-14T21:02:59,592 copying build/lib/flashinfer/quantization/fp4_quantization.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-14T21:02:59,595 copying build/lib/flashinfer/quantization/packbits.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-14T21:02:59,597 copying build/lib/flashinfer/quantization/quantization_cute_dsl_utils.py -> build/bdist.linux-armv7l/wheel/./flashinfer/quantization 2026-04-14T21:02:59,600 copying build/lib/build_utils.py -> build/bdist.linux-armv7l/wheel/. 2026-04-14T21:02:59,602 running install_egg_info 2026-04-14T21:02:59,614 running egg_info 2026-04-14T21:02:59,620 writing flashinfer_python.egg-info/PKG-INFO 2026-04-14T21:02:59,625 writing dependency_links to flashinfer_python.egg-info/dependency_links.txt 2026-04-14T21:02:59,627 writing entry points to flashinfer_python.egg-info/entry_points.txt 2026-04-14T21:02:59,629 writing requirements to flashinfer_python.egg-info/requires.txt 2026-04-14T21:02:59,631 writing top-level names to flashinfer_python.egg-info/top_level.txt 2026-04-14T21:03:00,425 reading manifest file 'flashinfer_python.egg-info/SOURCES.txt' 2026-04-14T21:03:00,544 adding license file 'LICENSE' 2026-04-14T21:03:00,667 writing manifest file 'flashinfer_python.egg-info/SOURCES.txt' 2026-04-14T21:03:00,673 Copying flashinfer_python.egg-info to build/bdist.linux-armv7l/wheel/./flashinfer_python-0.6.8rc1-py3.11.egg-info 2026-04-14T21:03:00,686 running install_scripts 2026-04-14T21:03:00,699 creating build/bdist.linux-armv7l/wheel/flashinfer_python-0.6.8rc1.dist-info/WHEEL 2026-04-14T21:03:00,702 creating '/tmp/pip-wheel-wd01vk05/.tmp-gk50xq_2/flashinfer_python-0.6.8rc1-py3-none-any.whl' and adding 'build/bdist.linux-armv7l/wheel' to it 2026-04-14T21:03:00,705 adding 'build_backend.py' 2026-04-14T21:03:00,706 adding 'build_utils.py' 2026-04-14T21:03:00,709 adding 'flashinfer/__init__.py' 2026-04-14T21:03:00,712 adding 'flashinfer/__main__.py' 2026-04-14T21:03:00,713 adding 'flashinfer/_build_meta.py' 2026-04-14T21:03:00,715 adding 'flashinfer/activation.py' 2026-04-14T21:03:00,719 adding 'flashinfer/aot.py' 2026-04-14T21:03:00,726 adding 'flashinfer/api_logging.py' 2026-04-14T21:03:00,728 adding 'flashinfer/artifacts.py' 2026-04-14T21:03:00,730 adding 'flashinfer/attention.py' 2026-04-14T21:03:00,737 adding 'flashinfer/autotuner.py' 2026-04-14T21:03:00,741 adding 'flashinfer/cascade.py' 2026-04-14T21:03:00,742 adding 'flashinfer/compilation_context.py' 2026-04-14T21:03:00,744 adding 'flashinfer/concat_ops.py' 2026-04-14T21:03:00,745 adding 'flashinfer/cuda_utils.py' 2026-04-14T21:03:00,756 adding 'flashinfer/decode.py' 2026-04-14T21:03:00,761 adding 'flashinfer/deep_gemm.py' 2026-04-14T21:03:00,763 adding 'flashinfer/fp4_quantization.py' 2026-04-14T21:03:00,764 adding 'flashinfer/fp8_quantization.py' 2026-04-14T21:03:00,767 adding 'flashinfer/gdn_decode.py' 2026-04-14T21:03:00,769 adding 'flashinfer/gdn_prefill.py' 2026-04-14T21:03:00,771 adding 'flashinfer/green_ctx.py' 2026-04-14T21:03:00,773 adding 'flashinfer/page.py' 2026-04-14T21:03:00,777 adding 'flashinfer/pod.py' 2026-04-14T21:03:00,794 adding 'flashinfer/prefill.py' 2026-04-14T21:03:00,796 adding 'flashinfer/py.typed' 2026-04-14T21:03:00,801 adding 'flashinfer/rope.py' 2026-04-14T21:03:00,807 adding 'flashinfer/sampling.py' 2026-04-14T21:03:00,812 adding 'flashinfer/sparse.py' 2026-04-14T21:03:00,814 adding 'flashinfer/tllm_enums.py' 2026-04-14T21:03:00,815 adding 'flashinfer/tllm_utils.py' 2026-04-14T21:03:00,817 adding 'flashinfer/topk.py' 2026-04-14T21:03:00,819 adding 'flashinfer/trtllm_low_latency_gemm.py' 2026-04-14T21:03:00,824 adding 'flashinfer/utils.py' 2026-04-14T21:03:00,825 adding 'flashinfer/version.py' 2026-04-14T21:03:00,828 adding 'flashinfer/xqa.py' 2026-04-14T21:03:00,830 adding 'flashinfer/comm/__init__.py' 2026-04-14T21:03:00,833 adding 'flashinfer/comm/allreduce.py' 2026-04-14T21:03:00,835 adding 'flashinfer/comm/cuda_ipc.py' 2026-04-14T21:03:00,837 adding 'flashinfer/comm/dlpack_utils.py' 2026-04-14T21:03:00,839 adding 'flashinfer/comm/mapping.py' 2026-04-14T21:03:00,845 adding 'flashinfer/comm/mnnvl.py' 2026-04-14T21:03:00,846 adding 'flashinfer/comm/nvshmem.py' 2026-04-14T21:03:00,848 adding 'flashinfer/comm/nvshmem_allreduce.py' 2026-04-14T21:03:00,850 adding 'flashinfer/comm/trtllm_alltoall.py' 2026-04-14T21:03:00,854 adding 'flashinfer/comm/trtllm_ar.py' 2026-04-14T21:03:00,858 adding 'flashinfer/comm/trtllm_mnnvl_ar.py' 2026-04-14T21:03:00,861 adding 'flashinfer/comm/trtllm_moe_alltoall.py' 2026-04-14T21:03:00,862 adding 'flashinfer/comm/vllm_ar.py' 2026-04-14T21:03:00,864 adding 'flashinfer/comm/workspace_base.py' 2026-04-14T21:03:00,866 adding 'flashinfer/cudnn/__init__.py' 2026-04-14T21:03:00,868 adding 'flashinfer/cudnn/decode.py' 2026-04-14T21:03:00,870 adding 'flashinfer/cudnn/prefill.py' 2026-04-14T21:03:00,872 adding 'flashinfer/cudnn/utils.py' 2026-04-14T21:03:00,874 adding 'flashinfer/cute_dsl/__init__.py' 2026-04-14T21:03:00,878 adding 'flashinfer/cute_dsl/add_rmsnorm_fp4quant.py' 2026-04-14T21:03:00,880 adding 'flashinfer/cute_dsl/blockscaled_gemm.py' 2026-04-14T21:03:00,884 adding 'flashinfer/cute_dsl/fp4_common.py' 2026-04-14T21:03:00,891 adding 'flashinfer/cute_dsl/gemm_allreduce_two_shot.py' 2026-04-14T21:03:00,896 adding 'flashinfer/cute_dsl/rmsnorm_fp4quant.py' 2026-04-14T21:03:00,898 adding 'flashinfer/cute_dsl/utils.py' 2026-04-14T21:03:00,900 adding 'flashinfer/cute_dsl/attention/__init__.py' 2026-04-14T21:03:00,903 adding 'flashinfer/cute_dsl/attention/collective_builder.py' 2026-04-14T21:03:00,905 adding 'flashinfer/cute_dsl/attention/compat.py' 2026-04-14T21:03:00,906 adding 'flashinfer/cute_dsl/attention/config.py' 2026-04-14T21:03:00,908 adding 'flashinfer/cute_dsl/attention/mainloop_spec.py' 2026-04-14T21:03:00,909 adding 'flashinfer/cute_dsl/attention/mla_config.py' 2026-04-14T21:03:00,912 adding 'flashinfer/cute_dsl/attention/mla_decode.py' 2026-04-14T21:03:00,916 adding 'flashinfer/cute_dsl/attention/mla_decode_fp8.py' 2026-04-14T21:03:00,918 adding 'flashinfer/cute_dsl/attention/mla_warp_schedule.py' 2026-04-14T21:03:00,920 adding 'flashinfer/cute_dsl/attention/pipeline_topology.py' 2026-04-14T21:03:00,922 adding 'flashinfer/cute_dsl/attention/prefill.py' 2026-04-14T21:03:00,924 adding 'flashinfer/cute_dsl/attention/tmem_layout.py' 2026-04-14T21:03:00,925 adding 'flashinfer/cute_dsl/attention/warp_schedule.py' 2026-04-14T21:03:00,926 adding 'flashinfer/cute_dsl/attention/fusion/__init__.py' 2026-04-14T21:03:00,928 adding 'flashinfer/cute_dsl/attention/fusion/mask.py' 2026-04-14T21:03:00,931 adding 'flashinfer/cute_dsl/attention/fusion/variant.py' 2026-04-14T21:03:00,933 adding 'flashinfer/cute_dsl/attention/roles/__init__.py' 2026-04-14T21:03:00,935 adding 'flashinfer/cute_dsl/attention/roles/correction.py' 2026-04-14T21:03:00,936 adding 'flashinfer/cute_dsl/attention/roles/epilogue.py' 2026-04-14T21:03:00,938 adding 'flashinfer/cute_dsl/attention/roles/loader_tma.py' 2026-04-14T21:03:00,941 adding 'flashinfer/cute_dsl/attention/roles/mla_compute.py' 2026-04-14T21:03:00,944 adding 'flashinfer/cute_dsl/attention/roles/mla_correction.py' 2026-04-14T21:03:00,946 adding 'flashinfer/cute_dsl/attention/roles/mla_loader.py' 2026-04-14T21:03:00,948 adding 'flashinfer/cute_dsl/attention/roles/mla_loader_fp8.py' 2026-04-14T21:03:00,950 adding 'flashinfer/cute_dsl/attention/roles/mla_mma.py' 2026-04-14T21:03:00,952 adding 'flashinfer/cute_dsl/attention/roles/mla_mma_fp8.py' 2026-04-14T21:03:00,954 adding 'flashinfer/cute_dsl/attention/roles/mla_pt_loader.py' 2026-04-14T21:03:00,955 adding 'flashinfer/cute_dsl/attention/roles/mma.py' 2026-04-14T21:03:00,958 adding 'flashinfer/cute_dsl/attention/roles/softmax.py' 2026-04-14T21:03:00,960 adding 'flashinfer/cute_dsl/attention/roles/softmax_math.py' 2026-04-14T21:03:00,962 adding 'flashinfer/cute_dsl/attention/scheduler/__init__.py' 2026-04-14T21:03:00,963 adding 'flashinfer/cute_dsl/attention/scheduler/mla_persistent.py' 2026-04-14T21:03:00,965 adding 'flashinfer/cute_dsl/attention/scheduler/persistent.py' 2026-04-14T21:03:00,967 adding 'flashinfer/cute_dsl/attention/wrappers/__init__.py' 2026-04-14T21:03:00,970 adding 'flashinfer/cute_dsl/attention/wrappers/batch_mla.py' 2026-04-14T21:03:00,972 adding 'flashinfer/cute_dsl/attention/wrappers/batch_prefill.py' 2026-04-14T21:03:00,974 adding 'flashinfer/data/build_backend.py' 2026-04-14T21:03:00,976 adding 'flashinfer/data/build_utils.py' 2026-04-14T21:03:00,981 adding 'flashinfer/data/csrc/batch_attention.cu' 2026-04-14T21:03:00,982 adding 'flashinfer/data/csrc/batch_attention_customize_config.jinja' 2026-04-14T21:03:00,983 adding 'flashinfer/data/csrc/batch_attention_jit_binding.cu' 2026-04-14T21:03:00,985 adding 'flashinfer/data/csrc/batch_attention_paged_kernel_inst.jinja' 2026-04-14T21:03:00,986 adding 'flashinfer/data/csrc/batch_decode.cu' 2026-04-14T21:03:00,988 adding 'flashinfer/data/csrc/batch_decode_customize_config.jinja' 2026-04-14T21:03:00,989 adding 'flashinfer/data/csrc/batch_decode_jit_binding.cu' 2026-04-14T21:03:00,990 adding 'flashinfer/data/csrc/batch_decode_kernel_inst.jinja' 2026-04-14T21:03:00,991 adding 'flashinfer/data/csrc/batch_decode_mla_binding.cu' 2026-04-14T21:03:00,992 adding 'flashinfer/data/csrc/batch_decode_mla_config.jinja' 2026-04-14T21:03:00,994 adding 'flashinfer/data/csrc/batch_decode_mla_cute_sm80.cu' 2026-04-14T21:03:00,995 adding 'flashinfer/data/csrc/batch_decode_mla_plan.cu' 2026-04-14T21:03:00,997 adding 'flashinfer/data/csrc/batch_decode_mla_run.cu' 2026-04-14T21:03:00,998 adding 'flashinfer/data/csrc/batch_mla_binding.cu' 2026-04-14T21:03:00,999 adding 'flashinfer/data/csrc/batch_mla_config.jinja' 2026-04-14T21:03:01,001 adding 'flashinfer/data/csrc/batch_mla_plan.cu' 2026-04-14T21:03:01,002 adding 'flashinfer/data/csrc/batch_mla_run.cu' 2026-04-14T21:03:01,003 adding 'flashinfer/data/csrc/batch_mla_sm90_binding.cu' 2026-04-14T21:03:01,005 adding 'flashinfer/data/csrc/batch_mla_sm90_plan.cu' 2026-04-14T21:03:01,006 adding 'flashinfer/data/csrc/batch_mla_sm90_run.cu' 2026-04-14T21:03:01,008 adding 'flashinfer/data/csrc/batch_pod.cu' 2026-04-14T21:03:01,010 adding 'flashinfer/data/csrc/batch_pod_customize_config.jinja' 2026-04-14T21:03:01,011 adding 'flashinfer/data/csrc/batch_pod_jit_binding.cu' 2026-04-14T21:03:01,012 adding 'flashinfer/data/csrc/batch_pod_kernel_inst.jinja' 2026-04-14T21:03:01,014 adding 'flashinfer/data/csrc/batch_prefill.cu' 2026-04-14T21:03:01,015 adding 'flashinfer/data/csrc/batch_prefill_customize_config.jinja' 2026-04-14T21:03:01,016 adding 'flashinfer/data/csrc/batch_prefill_fp8_paged_sm90_kernel_inst.jinja' 2026-04-14T21:03:01,018 adding 'flashinfer/data/csrc/batch_prefill_fp8_ragged_sm90_kernel_inst.jinja' 2026-04-14T21:03:01,019 adding 'flashinfer/data/csrc/batch_prefill_fp8_sm90.cu' 2026-04-14T21:03:01,020 adding 'flashinfer/data/csrc/batch_prefill_jit_binding.cu' 2026-04-14T21:03:01,022 adding 'flashinfer/data/csrc/batch_prefill_paged_kernel_inst.jinja' 2026-04-14T21:03:01,023 adding 'flashinfer/data/csrc/batch_prefill_paged_sm90_kernel_inst.jinja' 2026-04-14T21:03:01,024 adding 'flashinfer/data/csrc/batch_prefill_ragged_kernel_inst.jinja' 2026-04-14T21:03:01,025 adding 'flashinfer/data/csrc/batch_prefill_ragged_sm90_kernel_inst.jinja' 2026-04-14T21:03:01,027 adding 'flashinfer/data/csrc/batch_prefill_sm90.cu' 2026-04-14T21:03:01,028 adding 'flashinfer/data/csrc/batch_prefill_sm90_customize_config.jinja' 2026-04-14T21:03:01,030 adding 'flashinfer/data/csrc/batch_prefill_sm90_jit_binding.cu' 2026-04-14T21:03:01,031 adding 'flashinfer/data/csrc/bf16_gemm_cutlass.cu' 2026-04-14T21:03:01,033 adding 'flashinfer/data/csrc/bf16_gemm_cutlass.jinja' 2026-04-14T21:03:01,034 adding 'flashinfer/data/csrc/blackwell_fmha_plan.cu' 2026-04-14T21:03:01,035 adding 'flashinfer/data/csrc/bmm_fp8.cu' 2026-04-14T21:03:01,037 adding 'flashinfer/data/csrc/cascade.cu' 2026-04-14T21:03:01,038 adding 'flashinfer/data/csrc/concat_mla.cu' 2026-04-14T21:03:01,043 adding 'flashinfer/data/csrc/cudnn_sdpa_kernel_launcher.cu' 2026-04-14T21:03:01,045 adding 'flashinfer/data/csrc/cudnn_sdpa_utils.h' 2026-04-14T21:03:01,046 adding 'flashinfer/data/csrc/cutlass_mla.cu' 2026-04-14T21:03:01,048 adding 'flashinfer/data/csrc/dsv3_router_gemm.cu' 2026-04-14T21:03:01,049 adding 'flashinfer/data/csrc/flashinfer_cascade_binding.cu' 2026-04-14T21:03:01,050 adding 'flashinfer/data/csrc/flashinfer_gemm_binding.cu' 2026-04-14T21:03:01,051 adding 'flashinfer/data/csrc/flashinfer_gemm_sm90_binding.cu' 2026-04-14T21:03:01,053 adding 'flashinfer/data/csrc/flashinfer_mamba_binding.cu' 2026-04-14T21:03:01,054 adding 'flashinfer/data/csrc/flashinfer_mla_binding.cu' 2026-04-14T21:03:01,055 adding 'flashinfer/data/csrc/flashinfer_norm_binding.cu' 2026-04-14T21:03:01,057 adding 'flashinfer/data/csrc/flashinfer_page_binding.cu' 2026-04-14T21:03:01,058 adding 'flashinfer/data/csrc/flashinfer_quantization_binding.cu' 2026-04-14T21:03:01,059 adding 'flashinfer/data/csrc/flashinfer_rmsnorm_silu_binding.cu' 2026-04-14T21:03:01,061 adding 'flashinfer/data/csrc/flashinfer_rope_binding.cu' 2026-04-14T21:03:01,062 adding 'flashinfer/data/csrc/flashinfer_sampling_binding.cu' 2026-04-14T21:03:01,063 adding 'flashinfer/data/csrc/flashinfer_topk_binding.cu' 2026-04-14T21:03:01,065 adding 'flashinfer/data/csrc/flashinfer_xqa_binding.cu' 2026-04-14T21:03:01,066 adding 'flashinfer/data/csrc/flat_prefill_kernel_delta_rule_sm90_extern.inc' 2026-04-14T21:03:01,069 adding 'flashinfer/data/csrc/fmhaReduction.cu' 2026-04-14T21:03:01,070 adding 'flashinfer/data/csrc/fmha_cutlass_sm100.cu' 2026-04-14T21:03:01,072 adding 'flashinfer/data/csrc/fmha_cutlass_sm100_binding.cu' 2026-04-14T21:03:01,073 adding 'flashinfer/data/csrc/fmha_v2_jit_binding.cu' 2026-04-14T21:03:01,077 adding 'flashinfer/data/csrc/fmha_v2_run.cu' 2026-04-14T21:03:01,079 adding 'flashinfer/data/csrc/fp4_gemm_cutlass.cu' 2026-04-14T21:03:01,080 adding 'flashinfer/data/csrc/fp4_gemm_cutlass.jinja' 2026-04-14T21:03:01,082 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm103.cu' 2026-04-14T21:03:01,083 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm103.jinja' 2026-04-14T21:03:01,085 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm120.cu' 2026-04-14T21:03:01,086 adding 'flashinfer/data/csrc/fp4_gemm_cutlass_sm120.jinja' 2026-04-14T21:03:01,087 adding 'flashinfer/data/csrc/fp4_kv_dequantization.cu' 2026-04-14T21:03:01,089 adding 'flashinfer/data/csrc/fp4_kv_quantization.cu' 2026-04-14T21:03:01,091 adding 'flashinfer/data/csrc/fp8_blockscale_gemm_sm90_binding.cu' 2026-04-14T21:03:01,092 adding 'flashinfer/data/csrc/fp8_gemm_cutlass.cu' 2026-04-14T21:03:01,093 adding 'flashinfer/data/csrc/fp8_gemm_cutlass.jinja' 2026-04-14T21:03:01,095 adding 'flashinfer/data/csrc/gdn_prefill_launcher.cu' 2026-04-14T21:03:01,096 adding 'flashinfer/data/csrc/gdn_prefill_sm90_kernel_inst.jinja' 2026-04-14T21:03:01,098 adding 'flashinfer/data/csrc/gemm_groupwise_sm100.cu' 2026-04-14T21:03:01,099 adding 'flashinfer/data/csrc/gemm_groupwise_sm100_kernel_inst.jinja' 2026-04-14T21:03:01,101 adding 'flashinfer/data/csrc/gemm_groupwise_sm120.cu' 2026-04-14T21:03:01,102 adding 'flashinfer/data/csrc/gemm_groupwise_sm120_kernel_inst.jinja' 2026-04-14T21:03:01,103 adding 'flashinfer/data/csrc/gemm_sm100_binding.cu' 2026-04-14T21:03:01,104 adding 'flashinfer/data/csrc/gemm_sm120_binding.cu' 2026-04-14T21:03:01,106 adding 'flashinfer/data/csrc/group_gemm.cu' 2026-04-14T21:03:01,107 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100.cu' 2026-04-14T21:03:01,108 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm100_kernel_inst.jinja' 2026-04-14T21:03:01,110 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120.cu' 2026-04-14T21:03:01,111 adding 'flashinfer/data/csrc/group_gemm_fp8_groupwise_sm120_kernel_inst.jinja' 2026-04-14T21:03:01,113 adding 'flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100.cu' 2026-04-14T21:03:01,114 adding 'flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm100_kernel_inst.jinja' 2026-04-14T21:03:01,116 adding 'flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm120.cu' 2026-04-14T21:03:01,117 adding 'flashinfer/data/csrc/group_gemm_mxfp4_groupwise_sm120_kernel_inst.jinja' 2026-04-14T21:03:01,118 adding 'flashinfer/data/csrc/group_gemm_nvfp4_groupwise_sm120.cu' 2026-04-14T21:03:01,120 adding 'flashinfer/data/csrc/group_gemm_nvfp4_groupwise_sm120_kernel_inst.jinja' 2026-04-14T21:03:01,121 adding 'flashinfer/data/csrc/group_gemm_sm100_binding.cu' 2026-04-14T21:03:01,122 adding 'flashinfer/data/csrc/group_gemm_sm120_binding.cu' 2026-04-14T21:03:01,124 adding 'flashinfer/data/csrc/group_gemm_sm90.cu' 2026-04-14T21:03:01,125 adding 'flashinfer/data/csrc/group_gemm_sm90_kernel_inst.jinja' 2026-04-14T21:03:01,126 adding 'flashinfer/data/csrc/logging.cc' 2026-04-14T21:03:01,128 adding 'flashinfer/data/csrc/moe_utils_binding.cu' 2026-04-14T21:03:01,130 adding 'flashinfer/data/csrc/mxfp8_gemm_cutlass.cu' 2026-04-14T21:03:01,132 adding 'flashinfer/data/csrc/mxfp8_gemm_cutlass.jinja' 2026-04-14T21:03:01,133 adding 'flashinfer/data/csrc/mxfp8_gemm_cutlass_sm120.cu' 2026-04-14T21:03:01,135 adding 'flashinfer/data/csrc/mxfp8_gemm_cutlass_sm120.jinja' 2026-04-14T21:03:01,136 adding 'flashinfer/data/csrc/norm.cu' 2026-04-14T21:03:01,138 adding 'flashinfer/data/csrc/page.cu' 2026-04-14T21:03:01,140 adding 'flashinfer/data/csrc/pod.cu' 2026-04-14T21:03:01,141 adding 'flashinfer/data/csrc/pod_customize_config.jinja' 2026-04-14T21:03:01,142 adding 'flashinfer/data/csrc/pod_jit_binding.cu' 2026-04-14T21:03:01,144 adding 'flashinfer/data/csrc/pod_kernel_inst.jinja' 2026-04-14T21:03:01,145 adding 'flashinfer/data/csrc/prefill_kernel_delta_rule_sm90.cu' 2026-04-14T21:03:01,146 adding 'flashinfer/data/csrc/quantization.cu' 2026-04-14T21:03:01,148 adding 'flashinfer/data/csrc/renorm.cu' 2026-04-14T21:03:01,149 adding 'flashinfer/data/csrc/rmsnorm_silu.cu' 2026-04-14T21:03:01,152 adding 'flashinfer/data/csrc/rope.cu' 2026-04-14T21:03:01,153 adding 'flashinfer/data/csrc/runtime_utils.h' 2026-04-14T21:03:01,155 adding 'flashinfer/data/csrc/sampling.cu' 2026-04-14T21:03:01,157 adding 'flashinfer/data/csrc/sampling_utils.h' 2026-04-14T21:03:01,160 adding 'flashinfer/data/csrc/selective_state_update.cu' 2026-04-14T21:03:01,161 adding 'flashinfer/data/csrc/selective_state_update_customize_config.jinja' 2026-04-14T21:03:01,162 adding 'flashinfer/data/csrc/selective_state_update_dtype_inst.jinja' 2026-04-14T21:03:01,163 adding 'flashinfer/data/csrc/selective_state_update_kernel_inst.cu' 2026-04-14T21:03:01,164 adding 'flashinfer/data/csrc/seq_chunk_cumsum.cu' 2026-04-14T21:03:01,166 adding 'flashinfer/data/csrc/seq_chunk_cumsum_jit_binding.cu' 2026-04-14T21:03:01,167 adding 'flashinfer/data/csrc/single_decode.cu' 2026-04-14T21:03:01,168 adding 'flashinfer/data/csrc/single_decode_customize_config.jinja' 2026-04-14T21:03:01,169 adding 'flashinfer/data/csrc/single_decode_jit_binding.cu' 2026-04-14T21:03:01,171 adding 'flashinfer/data/csrc/single_decode_kernel_inst.jinja' 2026-04-14T21:03:01,172 adding 'flashinfer/data/csrc/single_prefill.cu' 2026-04-14T21:03:01,173 adding 'flashinfer/data/csrc/single_prefill_customize_config.jinja' 2026-04-14T21:03:01,175 adding 'flashinfer/data/csrc/single_prefill_fp8_sm90.cu' 2026-04-14T21:03:01,176 adding 'flashinfer/data/csrc/single_prefill_fp8_sm90_kernel_inst.jinja' 2026-04-14T21:03:01,177 adding 'flashinfer/data/csrc/single_prefill_jit_binding.cu' 2026-04-14T21:03:01,178 adding 'flashinfer/data/csrc/single_prefill_kernel_inst.jinja' 2026-04-14T21:03:01,180 adding 'flashinfer/data/csrc/single_prefill_sm90.cu' 2026-04-14T21:03:01,181 adding 'flashinfer/data/csrc/single_prefill_sm90_customize_config.jinja' 2026-04-14T21:03:01,182 adding 'flashinfer/data/csrc/single_prefill_sm90_jit_binding.cu' 2026-04-14T21:03:01,183 adding 'flashinfer/data/csrc/single_prefill_sm90_kernel_inst.jinja' 2026-04-14T21:03:01,185 adding 'flashinfer/data/csrc/tgv_gemm.cu' 2026-04-14T21:03:01,186 adding 'flashinfer/data/csrc/tgv_gemm.jinja' 2026-04-14T21:03:01,189 adding 'flashinfer/data/csrc/tinygemm2.cu' 2026-04-14T21:03:01,191 adding 'flashinfer/data/csrc/topk.cu' 2026-04-14T21:03:01,193 adding 'flashinfer/data/csrc/trtllm_allreduce.cu' 2026-04-14T21:03:01,194 adding 'flashinfer/data/csrc/trtllm_allreduce_fusion.cu' 2026-04-14T21:03:01,196 adding 'flashinfer/data/csrc/trtllm_alltoall.cu' 2026-04-14T21:03:01,199 adding 'flashinfer/data/csrc/trtllm_alltoall_prepare.cu' 2026-04-14T21:03:01,202 adding 'flashinfer/data/csrc/trtllm_batched_gemm_runner.cu' 2026-04-14T21:03:01,205 adding 'flashinfer/data/csrc/trtllm_fmha_kernel_launcher.cu' 2026-04-14T21:03:01,208 adding 'flashinfer/data/csrc/trtllm_fmha_v2_binding.cu' 2026-04-14T21:03:01,217 adding 'flashinfer/data/csrc/trtllm_fused_moe_kernel_launcher.cu' 2026-04-14T21:03:01,221 adding 'flashinfer/data/csrc/trtllm_fused_moe_runner.cu' 2026-04-14T21:03:01,223 adding 'flashinfer/data/csrc/trtllm_gemm_runner.cu' 2026-04-14T21:03:01,225 adding 'flashinfer/data/csrc/trtllm_low_latency_gemm_runner.cu' 2026-04-14T21:03:01,227 adding 'flashinfer/data/csrc/trtllm_mnnvl_allreduce.cu' 2026-04-14T21:03:01,228 adding 'flashinfer/data/csrc/trtllm_moe_allreduce_fusion.cu' 2026-04-14T21:03:01,231 adding 'flashinfer/data/csrc/trtllm_moe_alltoall.cu' 2026-04-14T21:03:01,233 adding 'flashinfer/data/csrc/tvm_ffi_utils.h' 2026-04-14T21:03:01,234 adding 'flashinfer/data/csrc/vllm_custom_all_reduce.cu' 2026-04-14T21:03:01,237 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention.h' 2026-04-14T21:03:01,239 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_demo_bert_params.h' 2026-04-14T21:03:01,241 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel.h' 2026-04-14T21:03:01,243 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN.h' 2026-04-14T21:03:01,245 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN_multi_cta.h' 2026-04-14T21:03:01,247 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_1xN_noloop.h' 2026-04-14T21:03:01,249 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_2x2.h' 2026-04-14T21:03:01,252 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper.h' 2026-04-14T21:03:01,254 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4x1_hopper_noloop.h' 2026-04-14T21:03:01,256 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper.h' 2026-04-14T21:03:01,258 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_kernel_4xN_hopper_noloop.h' 2026-04-14T21:03:01,263 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_attention_utils.h' 2026-04-14T21:03:01,265 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention.h' 2026-04-14T21:03:01,267 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN.h' 2026-04-14T21:03:01,268 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_cross_attention_kernel_1xN_noloop.h' 2026-04-14T21:03:01,271 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel.h' 2026-04-14T21:03:01,274 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop.h' 2026-04-14T21:03:01,276 adding 'flashinfer/data/csrc/fmha_v2/fused_multihead_flash_attention_kernel_noloop_tiled.h' 2026-04-14T21:03:01,278 adding 'flashinfer/data/csrc/fmha_v2/fmha/alibi_params.h' 2026-04-14T21:03:01,283 adding 'flashinfer/data/csrc/fmha_v2/fmha/fragment.h' 2026-04-14T21:03:01,285 adding 'flashinfer/data/csrc/fmha_v2/fmha/gemm.h' 2026-04-14T21:03:01,287 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_o.h' 2026-04-14T21:03:01,291 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_o_packed.h' 2026-04-14T21:03:01,294 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_ps.h' 2026-04-14T21:03:01,295 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_qkv.h' 2026-04-14T21:03:01,299 adding 'flashinfer/data/csrc/fmha_v2/fmha/gmem_tile_qkv_packed.h' 2026-04-14T21:03:01,302 adding 'flashinfer/data/csrc/fmha_v2/fmha/kernel_traits.h' 2026-04-14T21:03:01,305 adding 'flashinfer/data/csrc/fmha_v2/fmha/mask.h' 2026-04-14T21:03:01,306 adding 'flashinfer/data/csrc/fmha_v2/fmha/numeric_types.h' 2026-04-14T21:03:01,307 adding 'flashinfer/data/csrc/fmha_v2/fmha/paged_kv_cache.h' 2026-04-14T21:03:01,312 adding 'flashinfer/data/csrc/fmha_v2/fmha/smem_tile.h' 2026-04-14T21:03:01,317 adding 'flashinfer/data/csrc/fmha_v2/fmha/smem_tile_o.h' 2026-04-14T21:03:01,320 adding 'flashinfer/data/csrc/fmha_v2/fmha/smem_tile_qkv.h' 2026-04-14T21:03:01,322 adding 'flashinfer/data/csrc/fmha_v2/fmha/smem_tile_v.h' 2026-04-14T21:03:01,333 adding 'flashinfer/data/csrc/fmha_v2/fmha/softmax.h' 2026-04-14T21:03:01,337 adding 'flashinfer/data/csrc/fmha_v2/fmha/traits.h' 2026-04-14T21:03:01,343 adding 'flashinfer/data/csrc/fmha_v2/fmha/utils.h' 2026-04-14T21:03:01,346 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/arrive_wait.h' 2026-04-14T21:03:01,348 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/compute_tile.h' 2026-04-14T21:03:01,350 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/fragment.h' 2026-04-14T21:03:01,354 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/gmem_tile_o_packed.h' 2026-04-14T21:03:01,356 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/gmem_tile_qkv_packed.h' 2026-04-14T21:03:01,358 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/gmma_descriptor.h' 2026-04-14T21:03:01,360 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/kernel_traits.h' 2026-04-14T21:03:01,367 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/smem_tile.h' 2026-04-14T21:03:01,369 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/smem_tile_o.h' 2026-04-14T21:03:01,371 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/tma_descriptor.h' 2026-04-14T21:03:01,373 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/tma_types.h' 2026-04-14T21:03:01,374 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_gmma.h' 2026-04-14T21:03:01,377 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_hgmma.h' 2026-04-14T21:03:01,379 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_hgmma_bf16.h' 2026-04-14T21:03:01,381 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_igmma.h' 2026-04-14T21:03:01,385 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_qgmma.h' 2026-04-14T21:03:01,387 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_tma.h' 2026-04-14T21:03:01,388 adding 'flashinfer/data/csrc/fmha_v2/fmha/hopper/utils_warpgroup.h' 2026-04-14T21:03:01,390 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/circular_buffer.h' 2026-04-14T21:03:01,394 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/compute.h' 2026-04-14T21:03:01,398 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/dma.h' 2026-04-14T21:03:01,401 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/epilogue.h' 2026-04-14T21:03:01,404 adding 'flashinfer/data/csrc/fmha_v2/fmha/warpspec/kernel_traits.h' 2026-04-14T21:03:01,407 adding 'flashinfer/data/csrc/fmha_v2/templates/fa_kernel.jinja' 2026-04-14T21:03:01,409 adding 'flashinfer/data/csrc/fmha_v2/templates/kernel.jinja' 2026-04-14T21:03:01,410 adding 'flashinfer/data/csrc/fmha_v2/templates/kernel_hopper.jinja' 2026-04-14T21:03:01,412 adding 'flashinfer/data/csrc/fmha_v2/templates/kernel_hopper_ws.jinja' 2026-04-14T21:03:01,415 adding 'flashinfer/data/csrc/fused_moe/moeTopKFuncs.cuh' 2026-04-14T21:03:01,417 adding 'flashinfer/data/csrc/fused_moe/noAuxTcKernels.cu' 2026-04-14T21:03:01,419 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_instantiation.cu' 2026-04-14T21:03:01,442 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/cutlass_fused_moe_kernels.cuh' 2026-04-14T21:03:01,445 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/deepgemm_jit_setup.cu' 2026-04-14T21:03:01,450 adding 'flashinfer/data/csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_binding.cu' 2026-04-14T21:03:01,455 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_dev_kernel.cu' 2026-04-14T21:03:01,458 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_common.cu' 2026-04-14T21:03:01,462 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_custom.cu' 2026-04-14T21:03:01,466 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_deepseek.cu' 2026-04-14T21:03:01,469 adding 'flashinfer/data/csrc/fused_moe/trtllm_backend/trtllm_fused_moe_routing_llama4.cu' 2026-04-14T21:03:01,473 adding 'flashinfer/data/csrc/nv_internal/cpp/common/envUtils.cpp' 2026-04-14T21:03:01,474 adding 'flashinfer/data/csrc/nv_internal/cpp/common/logger.cpp' 2026-04-14T21:03:01,478 adding 'flashinfer/data/csrc/nv_internal/cpp/common/memoryUtils.cu' 2026-04-14T21:03:01,479 adding 'flashinfer/data/csrc/nv_internal/cpp/common/stringUtils.cpp' 2026-04-14T21:03:01,481 adding 'flashinfer/data/csrc/nv_internal/cpp/common/tllmException.cpp' 2026-04-14T21:03:01,484 adding 'flashinfer/data/csrc/nv_internal/cpp/kernels/quantization.cu' 2026-04-14T21:03:01,487 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/NvInferRuntime.h' 2026-04-14T21:03:01,488 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/assert.h' 2026-04-14T21:03:01,490 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/config.h' 2026-04-14T21:03:01,491 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaBf16Wrapper.h' 2026-04-14T21:03:01,493 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaFp8Utils.h' 2026-04-14T21:03:01,497 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/cudaUtils.h' 2026-04-14T21:03:01,498 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/dataType.h' 2026-04-14T21:03:01,500 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/logger.h' 2026-04-14T21:03:01,501 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/quantization.h' 2026-04-14T21:03:01,503 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/stringUtils.h' 2026-04-14T21:03:01,504 adding 'flashinfer/data/csrc/nv_internal/include/tensorrt_llm/common/tllmException.h' 2026-04-14T21:03:01,507 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cublasMMWrapper.h' 2026-04-14T21:03:01,508 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaBf16Fallbacks.cuh' 2026-04-14T21:03:01,510 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaDriverWrapper.h' 2026-04-14T21:03:01,512 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/cudaTypeUtils.cuh' 2026-04-14T21:03:01,513 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/envUtils.h' 2026-04-14T21:03:01,515 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/memoryUtils.h' 2026-04-14T21:03:01,516 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/quantTypeUtils.cuh' 2026-04-14T21:03:01,518 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/reduceKernelUtils.cuh' 2026-04-14T21:03:01,519 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/common/workspace.h' 2026-04-14T21:03:01,522 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/compute_occupancy.h' 2026-04-14T21:03:01,524 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue_helpers.h' 2026-04-14T21:03:01,526 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm_configs.h' 2026-04-14T21:03:01,528 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/interleaved_numeric_conversion.h' 2026-04-14T21:03:01,530 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/system_barrier.h' 2026-04-14T21:03:01,531 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/tile_interleaved_layout.h' 2026-04-14T21:03:01,532 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/weight_only_quant_op.h' 2026-04-14T21:03:01,534 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_red_global.hpp' 2026-04-14T21:03:01,536 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_sm90_multimem.hpp' 2026-04-14T21:03:01,537 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_traits_sm90_multimem.hpp' 2026-04-14T21:03:01,539 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/grid_dependency_control.h' 2026-04-14T21:03:01,540 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/mma.h' 2026-04-14T21:03:01,543 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/communication/collective/sm90_allreduce_nvls_warpspecialized.hpp' 2026-04-14T21:03:01,547 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/detail/collective/mixed_input_utils.hpp' 2026-04-14T21:03:01,551 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/collective/epilogue_moe_finalize.hpp' 2026-04-14T21:03:01,554 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_allreduce_tma_warpspecialized.hpp' 2026-04-14T21:03:01,557 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/fusion/sm90_visitor_scatter.hpp' 2026-04-14T21:03:01,559 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/epilogue/thread/fused_activations.h' 2026-04-14T21:03:01,561 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_gated.hpp' 2026-04-14T21:03:01,563 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_interleaved.hpp' 2026-04-14T21:03:01,564 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_builder_mixed_input.hpp' 2026-04-14T21:03:01,565 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_array_mixed_input.hpp' 2026-04-14T21:03:01,566 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_gated.hpp' 2026-04-14T21:03:01,568 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/collective_mma_interleaved.hpp' 2026-04-14T21:03:01,575 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input_.hpp' 2026-04-14T21:03:01,578 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized.hpp' 2026-04-14T21:03:01,581 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_gated_tma_gmma_ss_warpspecialized_fp8.hpp' 2026-04-14T21:03:01,589 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/sm90_mma_interleaved_tma_gmma_rs_warpspecialized_mixed_input.hpp' 2026-04-14T21:03:01,591 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_gated.inl' 2026-04-14T21:03:01,593 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_interleaved.inl' 2026-04-14T21:03:01,595 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/collective/builders/sm90_gmma_builder_mixed_input.inl' 2026-04-14T21:03:01,597 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/default_fpA_intB_traits.h' 2026-04-14T21:03:01,599 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel.cuh' 2026-04-14T21:03:01,602 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_routine.cuh' 2026-04-14T21:03:01,604 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/fused_moe_kernel_traits.cuh' 2026-04-14T21:03:01,605 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_moe_problem_visitor.h' 2026-04-14T21:03:01,606 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/gemm_universal_allreduce.hpp' 2026-04-14T21:03:01,608 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/mixed_gemm_B_layout.h' 2026-04-14T21:03:01,609 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cute_util.cuh' 2026-04-14T21:03:01,612 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_cutlass_kernel.h' 2026-04-14T21:03:01,615 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/moe_problem_visitor.h' 2026-04-14T21:03:01,617 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized.hpp' 2026-04-14T21:03:01,621 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/kernel/sm90_gemm_allreduce_tma_warpspecialized_pingpong.hpp' 2026-04-14T21:03:01,623 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma.h' 2026-04-14T21:03:01,625 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_multistage.h' 2026-04-14T21:03:01,627 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_dq_mma_pipelined.h' 2026-04-14T21:03:01,628 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma.h' 2026-04-14T21:03:01,630 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/default_mma_bf16.h' 2026-04-14T21:03:01,632 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_base.h' 2026-04-14T21:03:01,633 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage.h' 2026-04-14T21:03:01,636 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_finegrained.h' 2026-04-14T21:03:01,639 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_multistage_percol.h' 2026-04-14T21:03:01,641 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined.h' 2026-04-14T21:03:01,643 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_finegrained.h' 2026-04-14T21:03:01,646 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/threadblock/dq_mma_pipelined_percol.h' 2026-04-14T21:03:01,648 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/default_mma_tensor_op.h' 2026-04-14T21:03:01,649 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_compute_B_with_f16.h' 2026-04-14T21:03:01,651 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/gemm/warp/mma_tensorop_dequantizer.h' 2026-04-14T21:03:01,654 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/transform/threadblock/fine_grained_scale_zero_iterator.h' 2026-04-14T21:03:01,657 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/util/gather_tensor.hpp' 2026-04-14T21:03:01,660 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/compiler.cuh' 2026-04-14T21:03:01,662 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm.cuh' 2026-04-14T21:03:01,665 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/fp8_gemm_impl.cuh' 2026-04-14T21:03:01,667 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/jit_utils.cuh' 2026-04-14T21:03:01,670 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/mma_utils.cuh' 2026-04-14T21:03:01,677 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_cutlass.cuh' 2026-04-14T21:03:01,679 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/nvrtc_std.cuh' 2026-04-14T21:03:01,681 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/runtime.cuh' 2026-04-14T21:03:01,684 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/scheduler.cuh' 2026-04-14T21:03:01,685 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/tma_utils.cuh' 2026-04-14T21:03:01,687 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/deep_gemm/utils.cuh' 2026-04-14T21:03:01,689 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.cu' 2026-04-14T21:03:01,690 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/delayStream.h' 2026-04-14T21:03:01,691 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.cu' 2026-04-14T21:03:01,693 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/preQuantScaleKernel.h' 2026-04-14T21:03:01,696 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.cuh' 2026-04-14T21:03:01,697 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization.h' 2026-04-14T21:03:01,701 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/quantization_utils.cuh' 2026-04-14T21:03:01,706 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.cu' 2026-04-14T21:03:01,708 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/communicationKernels/moeAlltoAllKernels.h' 2026-04-14T21:03:01,711 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.cu' 2026-04-14T21:03:01,712 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cuteDslKernels/moeUtils.h' 2026-04-14T21:03:01,716 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.cpp' 2026-04-14T21:03:01,718 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_heuristic.h' 2026-04-14T21:03:01,719 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/cutlass_type_conversion.h' 2026-04-14T21:03:01,722 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.cu' 2026-04-14T21:03:01,724 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm.h' 2026-04-14T21:03:01,730 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_gemm_kernel.cuh' 2026-04-14T21:03:01,733 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_mma_utils.cuh' 2026-04-14T21:03:01,735 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_tma_utils.cuh' 2026-04-14T21:03:01,738 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_fp8_gemm_1d1d.cuh' 2026-04-14T21:03:01,740 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/ada_blockwise_gemm/sm89_utils.cuh' 2026-04-14T21:03:01,742 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scalebias.cu' 2026-04-14T21:03:01,743 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_fg_scaleonly.cu' 2026-04-14T21:03:01,744 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int4_gemm_per_col.cu' 2026-04-14T21:03:01,746 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scalebias.cu' 2026-04-14T21:03:01,747 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_fg_scaleonly.cu' 2026-04-14T21:03:01,748 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/bf16_int8_gemm_per_col.cu' 2026-04-14T21:03:01,749 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_bf16_out_bf16.cu' 2026-04-14T21:03:01,751 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scalebias_f16_out_f16.cu' 2026-04-14T21:03:01,752 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_bf16_out_bf16.cu' 2026-04-14T21:03:01,753 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_fg_scaleonly_f16_out_f16.cu' 2026-04-14T21:03:01,754 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/e4m3_int4_gemm_per_col_f16_out_f16.cu' 2026-04-14T21:03:01,756 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scalebias.cu' 2026-04-14T21:03:01,757 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_fg_scaleonly.cu' 2026-04-14T21:03:01,758 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int4_gemm_per_col.cu' 2026-04-14T21:03:01,759 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scalebias.cu' 2026-04-14T21:03:01,761 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_fg_scaleonly.cu' 2026-04-14T21:03:01,762 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fp16_int8_gemm_per_col.cu' 2026-04-14T21:03:01,763 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm.h' 2026-04-14T21:03:01,767 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template.h' 2026-04-14T21:03:01,769 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template_sm90.h' 2026-04-14T21:03:01,770 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.h' 2026-04-14T21:03:01,773 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/fpA_intB_gemm/launchers/fpA_intB_launcher_sm90.inl' 2026-04-14T21:03:01,775 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/common.h' 2026-04-14T21:03:01,776 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/cutlass_kernel_selector.h' 2026-04-14T21:03:01,778 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_gemm_kernels.h' 2026-04-14T21:03:01,783 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_kernels.h' 2026-04-14T21:03:01,785 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/include/moe_util_kernels.h' 2026-04-14T21:03:01,787 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_bf16.cu' 2026-04-14T21:03:01,788 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp4.cu' 2026-04-14T21:03:01,789 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_fp8.cu' 2026-04-14T21:03:01,791 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint4.cu' 2026-04-14T21:03:01,792 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_bf16_uint8.cu' 2026-04-14T21:03:01,793 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp16.cu' 2026-04-14T21:03:01,795 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_fp4.cu' 2026-04-14T21:03:01,796 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint4.cu' 2026-04-14T21:03:01,797 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp16_uint8.cu' 2026-04-14T21:03:01,798 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp32_fp32.cu' 2026-04-14T21:03:01,800 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp4_fp4.cu' 2026-04-14T21:03:01,801 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp4.cu' 2026-04-14T21:03:01,802 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_fp8.cu' 2026-04-14T21:03:01,803 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_kernels_fp8_uint4.cu' 2026-04-14T21:03:01,808 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch.h' 2026-04-14T21:03:01,811 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws.h' 2026-04-14T21:03:01,813 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_template_dispatch_tma_ws_mixed_dtype.h' 2026-04-14T21:03:01,815 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_gemm_tma_warp_specialized_input.cu' 2026-04-14T21:03:01,816 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_kernels.cuh' 2026-04-14T21:03:01,817 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/moe_tma_warp_specialized_traits.h' 2026-04-14T21:03:01,819 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.h' 2026-04-14T21:03:01,821 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/fused_moe_gemm_launcher_sm80.inl' 2026-04-14T21:03:01,822 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.h' 2026-04-14T21:03:01,829 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_launcher.inl' 2026-04-14T21:03:01,832 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.h' 2026-04-14T21:03:01,834 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/cutlass_kernels/moe_gemm/launchers/moe_gemm_tma_ws_mixed_input_launcher.inl' 2026-04-14T21:03:01,835 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.cpp' 2026-04-14T21:03:01,837 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/kernels/lora/lora.h' 2026-04-14T21:03:01,840 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Op.cpp' 2026-04-14T21:03:01,841 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.cpp' 2026-04-14T21:03:01,843 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp4Quantize.h' 2026-04-14T21:03:01,845 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.cpp' 2026-04-14T21:03:01,846 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/fp8Quantize.h' 2026-04-14T21:03:01,847 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/moeAlltoAllMeta.h' 2026-04-14T21:03:01,849 adding 'flashinfer/data/csrc/nv_internal/tensorrt_llm/thop/utils.h' 2026-04-14T21:03:01,851 adding 'flashinfer/data/csrc/xqa/barriers.cuh' 2026-04-14T21:03:01,852 adding 'flashinfer/data/csrc/xqa/cuda_hint.cuh' 2026-04-14T21:03:01,854 adding 'flashinfer/data/csrc/xqa/defines.h' 2026-04-14T21:03:01,855 adding 'flashinfer/data/csrc/xqa/gmma.cuh' 2026-04-14T21:03:01,865 adding 'flashinfer/data/csrc/xqa/gmma_impl.cuh' 2026-04-14T21:03:01,868 adding 'flashinfer/data/csrc/xqa/hostUtils.h' 2026-04-14T21:03:01,870 adding 'flashinfer/data/csrc/xqa/ldgsts.cuh' 2026-04-14T21:03:01,884 adding 'flashinfer/data/csrc/xqa/mha.cu' 2026-04-14T21:03:01,886 adding 'flashinfer/data/csrc/xqa/mha.h' 2026-04-14T21:03:01,889 adding 'flashinfer/data/csrc/xqa/mhaUtils.cuh' 2026-04-14T21:03:01,890 adding 'flashinfer/data/csrc/xqa/mha_components.cuh' 2026-04-14T21:03:01,904 adding 'flashinfer/data/csrc/xqa/mha_sm90.cu' 2026-04-14T21:03:01,907 adding 'flashinfer/data/csrc/xqa/mha_stdheaders.cuh' 2026-04-14T21:03:01,915 adding 'flashinfer/data/csrc/xqa/mla_sm120.cu' 2026-04-14T21:03:01,917 adding 'flashinfer/data/csrc/xqa/mla_sm120.cuh' 2026-04-14T21:03:01,918 adding 'flashinfer/data/csrc/xqa/mma.cuh' 2026-04-14T21:03:01,919 adding 'flashinfer/data/csrc/xqa/platform.h' 2026-04-14T21:03:01,921 adding 'flashinfer/data/csrc/xqa/specDec.h' 2026-04-14T21:03:01,922 adding 'flashinfer/data/csrc/xqa/tensorMap.cpp' 2026-04-14T21:03:01,923 adding 'flashinfer/data/csrc/xqa/tensorMap.h' 2026-04-14T21:03:01,925 adding 'flashinfer/data/csrc/xqa/tma.h' 2026-04-14T21:03:01,929 adding 'flashinfer/data/csrc/xqa/utils.cuh' 2026-04-14T21:03:01,930 adding 'flashinfer/data/csrc/xqa/utils.h' 2026-04-14T21:03:01,932 adding 'flashinfer/data/csrc/xqa/xqa_wrapper.cu' 2026-04-14T21:03:01,935 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/conv2d.py' 2026-04-14T21:03:01,937 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/gemm.py' 2026-04-14T21:03:01,939 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/gemm_grouped.py' 2026-04-14T21:03:01,941 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/customizable/conv2d.py' 2026-04-14T21:03:01,943 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm.py' 2026-04-14T21:03:01,945 adding 'flashinfer/data/cutlass/examples/40_cutlass_py/customizable/gemm_grouped.py' 2026-04-14T21:03:01,948 adding 'flashinfer/data/cutlass/examples/41_fused_multi_head_attention/fmha_backward_test.py' 2026-04-14T21:03:01,949 adding 'flashinfer/data/cutlass/examples/41_fused_multi_head_attention/piped_subprocess.py' 2026-04-14T21:03:01,952 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_all_code.py' 2026-04-14T21:03:01,953 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_cmake.py' 2026-04-14T21:03:01,955 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_customized_epilogue.py' 2026-04-14T21:03:01,957 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_device.py' 2026-04-14T21:03:01,959 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_ir.py' 2026-04-14T21:03:01,961 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_kernel.py' 2026-04-14T21:03:01,963 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_sample.py' 2026-04-14T21:03:01,968 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_threadblock.py' 2026-04-14T21:03:01,970 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_turing_and_volta.py' 2026-04-14T21:03:01,972 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/gen_verify.py' 2026-04-14T21:03:01,974 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/helper.py' 2026-04-14T21:03:01,975 adding 'flashinfer/data/cutlass/examples/44_multi_gemm_ir_and_codegen/ir_gen/replace_fix_impl_header.py' 2026-04-14T21:03:01,978 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_bypass_dlpack.py' 2026-04-14T21:03:01,980 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/call_from_jit.py' 2026-04-14T21:03:01,983 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/cooperative_launch.py' 2026-04-14T21:03:01,985 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/dynamic_smem_size.py' 2026-04-14T21:03:01,987 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add.py' 2026-04-14T21:03:01,990 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_add_autotune.py' 2026-04-14T21:03:01,992 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/elementwise_apply.py' 2026-04-14T21:03:01,997 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/flash_attention_v2.py' 2026-04-14T21:03:02,002 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/hstu_attention.py' 2026-04-14T21:03:02,004 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/inline_ptx.py' 2026-04-14T21:03:02,008 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/sgemm.py' 2026-04-14T21:03:02,010 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/smem_allocator.py' 2026-04-14T21:03:02,014 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/ampere/tensorop_gemm.py' 2026-04-14T21:03:02,026 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent.py' 2026-04-14T21:03:02,036 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_amax.py' 2026-04-14T21:03:02,046 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_blockscaled_gemm_persistent_prefetch.py' 2026-04-14T21:03:02,054 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm.py' 2026-04-14T21:03:02,062 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_alpha_beta_persistent.py' 2026-04-14T21:03:02,070 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent.py' 2026-04-14T21:03:02,078 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_dynamic.py' 2026-04-14T21:03:02,086 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_persistent_prefetch.py' 2026-04-14T21:03:02,094 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/dense_gemm_software_pipeline.py' 2026-04-14T21:03:02,105 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha.py' 2026-04-14T21:03:02,117 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/fmha_bwd.py' 2026-04-14T21:03:02,129 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_blockscaled_gemm.py' 2026-04-14T21:03:02,139 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/grouped_gemm.py' 2026-04-14T21:03:02,156 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla.py' 2026-04-14T21:03:02,160 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/programmatic_dependent_launch.py' 2026-04-14T21:03:02,163 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/reduce.py' 2026-04-14T21:03:02,166 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/rmsnorm.py' 2026-04-14T21:03:02,177 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/sm103_dense_blockscaled_gemm_persistent.py' 2026-04-14T21:03:02,188 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/blockwise_gemm.py' 2026-04-14T21:03:02,199 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/contiguous_grouped_gemm.py' 2026-04-14T21:03:02,210 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/blockwise_gemm/masked_grouped_gemm.py' 2026-04-14T21:03:02,214 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/activation_custom_epilogue_dense_gemm.py' 2026-04-14T21:03:02,222 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_dense_gemm_efc.py' 2026-04-14T21:03:02,229 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/common_efc.py' 2026-04-14T21:03:02,232 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/custom_epilogue_dense_gemm.py' 2026-04-14T21:03:02,234 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/epilogue/synthetic_custom_epilogue_dense_gemm.py' 2026-04-14T21:03:02,247 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd.py' 2026-04-14T21:03:02,250 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_reference.py' 2026-04-14T21:03:02,252 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mamba2_ssd/mamba2_ssd_tile_scheduler.py' 2026-04-14T21:03:02,260 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_decode.py' 2026-04-14T21:03:02,267 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d256.py' 2026-04-14T21:03:02,275 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/mixed_input_fmha_prefill_d512.py' 2026-04-14T21:03:02,277 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_fmha/prefill_helpers.py' 2026-04-14T21:03:02,288 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm.py' 2026-04-14T21:03:02,298 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/grouped_mixed_input_gemm_acc_scale.py' 2026-04-14T21:03:02,307 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_gemm.py' 2026-04-14T21:03:02,310 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mixed_input_gemm/mixed_input_host_utils.py' 2026-04-14T21:03:02,325 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp16.py' 2026-04-14T21:03:02,340 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_decode_fp8.py' 2026-04-14T21:03:02,343 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/mla/mla_helpers.py' 2026-04-14T21:03:02,346 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_0.py' 2026-04-14T21:03:02,348 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_1.py' 2026-04-14T21:03:02,352 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_0.py' 2026-04-14T21:03:02,355 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/nvfp4_gemm_1.py' 2026-04-14T21:03:02,358 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/utils.py' 2026-04-14T21:03:02,363 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/blackwell_geforce/dense_gemm.py' 2026-04-14T21:03:02,366 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/print_latex.py' 2026-04-14T21:03:02,367 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/torch_fake_tensor.py' 2026-04-14T21:03:02,369 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export/export_to_c.py' 2026-04-14T21:03:02,370 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/export/load_in_python.py' 2026-04-14T21:03:02,373 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/ffi/jit_argument.py' 2026-04-14T21:03:02,375 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/ampere_gemm_with_fake_tensor.py' 2026-04-14T21:03:02,376 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_export.py' 2026-04-14T21:03:02,378 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_jax.py' 2026-04-14T21:03:02,379 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/aot_use_in_torch.py' 2026-04-14T21:03:02,380 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/compile_with_fake_tensor.py' 2026-04-14T21:03:02,382 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/error_reporting.py' 2026-04-14T21:03:02,383 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_jax.py' 2026-04-14T21:03:02,385 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/cute/tvm_ffi/jit_and_use_in_torch.py' 2026-04-14T21:03:02,388 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_one_shot_lamport.py' 2026-04-14T21:03:02,390 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_simple.py' 2026-04-14T21:03:02,393 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_tma.py' 2026-04-14T21:03:02,395 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/all_reduce_two_shot_multimem.py' 2026-04-14T21:03:02,405 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_all_gather_gemm_blackwell.py' 2026-04-14T21:03:02,415 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_all_reduce_blackwell.py' 2026-04-14T21:03:02,425 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/distributed/distributed_gemm_reduce_scatter_blackwell.py' 2026-04-14T21:03:02,428 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/ampere/memcpy_simt_universal_copy.py' 2026-04-14T21:03:02,433 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_block_scaled_gemm.py' 2026-04-14T21:03:02,441 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm.py' 2026-04-14T21:03:02,443 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_2sm.py' 2026-04-14T21:03:02,451 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_cute_pipeline.py' 2026-04-14T21:03:02,454 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/experimental/blackwell/dense_gemm_ptr_array.py' 2026-04-14T21:03:02,456 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/helpers/__init__.py' 2026-04-14T21:03:02,459 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/helpers/fmha_helpers.py' 2026-04-14T21:03:02,462 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/cta_norm.py' 2026-04-14T21:03:02,468 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm.py' 2026-04-14T21:03:02,474 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/dense_gemm_persistent.py' 2026-04-14T21:03:02,483 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/hopper/fmha.py' 2026-04-14T21:03:02,486 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_basic.py' 2026-04-14T21:03:02,488 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_export.py' 2026-04-14T21:03:02,490 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/jax/cutlass_call_sharding.py' 2026-04-14T21:03:02,492 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/jax/elementwise_apply_example.py' 2026-04-14T21:03:02,493 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/utils/__init__.py' 2026-04-14T21:03:02,497 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/utils/fmha_helpers.py' 2026-04-14T21:03:02,499 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/utils/sparse_utils.py' 2026-04-14T21:03:02,500 adding 'flashinfer/data/cutlass/examples/python/CuTeDSL/utils/test_sparse_utils.py' 2026-04-14T21:03:02,503 adding 'flashinfer/data/cutlass/include/cute/config.hpp' 2026-04-14T21:03:02,506 adding 'flashinfer/data/cutlass/include/cute/int_tuple.hpp' 2026-04-14T21:03:02,513 adding 'flashinfer/data/cutlass/include/cute/layout.hpp' 2026-04-14T21:03:02,515 adding 'flashinfer/data/cutlass/include/cute/layout_composed.hpp' 2026-04-14T21:03:02,518 adding 'flashinfer/data/cutlass/include/cute/pointer.hpp' 2026-04-14T21:03:02,519 adding 'flashinfer/data/cutlass/include/cute/pointer_base.hpp' 2026-04-14T21:03:02,521 adding 'flashinfer/data/cutlass/include/cute/pointer_flagged.hpp' 2026-04-14T21:03:02,523 adding 'flashinfer/data/cutlass/include/cute/pointer_sparse.hpp' 2026-04-14T21:03:02,524 adding 'flashinfer/data/cutlass/include/cute/pointer_swizzle.hpp' 2026-04-14T21:03:02,527 adding 'flashinfer/data/cutlass/include/cute/stride.hpp' 2026-04-14T21:03:02,529 adding 'flashinfer/data/cutlass/include/cute/swizzle.hpp' 2026-04-14T21:03:02,532 adding 'flashinfer/data/cutlass/include/cute/swizzle_layout.hpp' 2026-04-14T21:03:02,534 adding 'flashinfer/data/cutlass/include/cute/tensor.hpp' 2026-04-14T21:03:02,538 adding 'flashinfer/data/cutlass/include/cute/tensor_impl.hpp' 2026-04-14T21:03:02,540 adding 'flashinfer/data/cutlass/include/cute/tensor_zip.hpp' 2026-04-14T21:03:02,541 adding 'flashinfer/data/cutlass/include/cute/underscore.hpp' 2026-04-14T21:03:02,543 adding 'flashinfer/data/cutlass/include/cute/algorithm/axpby.hpp' 2026-04-14T21:03:02,545 adding 'flashinfer/data/cutlass/include/cute/algorithm/clear.hpp' 2026-04-14T21:03:02,547 adding 'flashinfer/data/cutlass/include/cute/algorithm/cooperative_copy.hpp' 2026-04-14T21:03:02,549 adding 'flashinfer/data/cutlass/include/cute/algorithm/cooperative_gemm.hpp' 2026-04-14T21:03:02,552 adding 'flashinfer/data/cutlass/include/cute/algorithm/copy.hpp' 2026-04-14T21:03:02,553 adding 'flashinfer/data/cutlass/include/cute/algorithm/fill.hpp' 2026-04-14T21:03:02,555 adding 'flashinfer/data/cutlass/include/cute/algorithm/functional.hpp' 2026-04-14T21:03:02,557 adding 'flashinfer/data/cutlass/include/cute/algorithm/gemm.hpp' 2026-04-14T21:03:02,558 adding 'flashinfer/data/cutlass/include/cute/algorithm/prefer.hpp' 2026-04-14T21:03:02,559 adding 'flashinfer/data/cutlass/include/cute/algorithm/prefetch.hpp' 2026-04-14T21:03:02,561 adding 'flashinfer/data/cutlass/include/cute/algorithm/tensor_algorithms.hpp' 2026-04-14T21:03:02,562 adding 'flashinfer/data/cutlass/include/cute/algorithm/tensor_reduce.hpp' 2026-04-14T21:03:02,565 adding 'flashinfer/data/cutlass/include/cute/algorithm/tuple_algorithms.hpp' 2026-04-14T21:03:02,568 adding 'flashinfer/data/cutlass/include/cute/arch/cluster_sm100.hpp' 2026-04-14T21:03:02,569 adding 'flashinfer/data/cutlass/include/cute/arch/cluster_sm90.hpp' 2026-04-14T21:03:02,571 adding 'flashinfer/data/cutlass/include/cute/arch/config.hpp' 2026-04-14T21:03:02,572 adding 'flashinfer/data/cutlass/include/cute/arch/copy.hpp' 2026-04-14T21:03:02,584 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm100.hpp' 2026-04-14T21:03:02,588 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm100_tma.hpp' 2026-04-14T21:03:02,590 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm50.hpp' 2026-04-14T21:03:02,591 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm75.hpp' 2026-04-14T21:03:02,592 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm80.hpp' 2026-04-14T21:03:02,594 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm90.hpp' 2026-04-14T21:03:02,596 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm90_desc.hpp' 2026-04-14T21:03:02,599 adding 'flashinfer/data/cutlass/include/cute/arch/copy_sm90_tma.hpp' 2026-04-14T21:03:02,601 adding 'flashinfer/data/cutlass/include/cute/arch/mma.hpp' 2026-04-14T21:03:02,603 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm100.hpp' 2026-04-14T21:03:02,605 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm100_desc.hpp' 2026-04-14T21:03:02,609 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm100_umma.hpp' 2026-04-14T21:03:02,614 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm120.hpp' 2026-04-14T21:03:02,619 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm120_sparse.hpp' 2026-04-14T21:03:02,621 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm61.hpp' 2026-04-14T21:03:02,623 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm70.hpp' 2026-04-14T21:03:02,624 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm75.hpp' 2026-04-14T21:03:02,628 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm80.hpp' 2026-04-14T21:03:02,629 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm89.hpp' 2026-04-14T21:03:02,642 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90.hpp' 2026-04-14T21:03:02,646 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_desc.hpp' 2026-04-14T21:03:02,681 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma.hpp' 2026-04-14T21:03:02,772 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_ext.hpp' 2026-04-14T21:03:02,826 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse.hpp' 2026-04-14T21:03:02,923 adding 'flashinfer/data/cutlass/include/cute/arch/mma_sm90_gmma_sparse_ext.hpp' 2026-04-14T21:03:02,942 adding 'flashinfer/data/cutlass/include/cute/arch/simd_sm100.hpp' 2026-04-14T21:03:02,944 adding 'flashinfer/data/cutlass/include/cute/arch/tmem_allocator_sm100.hpp' 2026-04-14T21:03:02,945 adding 'flashinfer/data/cutlass/include/cute/arch/util.hpp' 2026-04-14T21:03:02,949 adding 'flashinfer/data/cutlass/include/cute/atom/copy_atom.hpp' 2026-04-14T21:03:02,951 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits.hpp' 2026-04-14T21:03:02,958 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100.hpp' 2026-04-14T21:03:02,961 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_im2col.hpp' 2026-04-14T21:03:02,964 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm100_tma.hpp' 2026-04-14T21:03:02,965 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm50.hpp' 2026-04-14T21:03:02,966 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm75.hpp' 2026-04-14T21:03:02,968 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm80.hpp' 2026-04-14T21:03:02,969 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90.hpp' 2026-04-14T21:03:02,973 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_im2col.hpp' 2026-04-14T21:03:02,981 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma.hpp' 2026-04-14T21:03:02,982 adding 'flashinfer/data/cutlass/include/cute/atom/copy_traits_sm90_tma_swizzle.hpp' 2026-04-14T21:03:02,985 adding 'flashinfer/data/cutlass/include/cute/atom/mma_atom.hpp' 2026-04-14T21:03:02,987 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits.hpp' 2026-04-14T21:03:02,997 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm100.hpp' 2026-04-14T21:03:03,000 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120.hpp' 2026-04-14T21:03:03,002 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm120_sparse.hpp' 2026-04-14T21:03:03,004 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm61.hpp' 2026-04-14T21:03:03,005 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm70.hpp' 2026-04-14T21:03:03,007 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm75.hpp' 2026-04-14T21:03:03,009 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm80.hpp' 2026-04-14T21:03:03,010 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm89.hpp' 2026-04-14T21:03:03,012 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90.hpp' 2026-04-14T21:03:03,022 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma.hpp' 2026-04-14T21:03:03,045 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_ext.hpp' 2026-04-14T21:03:03,058 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse.hpp' 2026-04-14T21:03:03,077 adding 'flashinfer/data/cutlass/include/cute/atom/mma_traits_sm90_gmma_sparse_ext.hpp' 2026-04-14T21:03:03,082 adding 'flashinfer/data/cutlass/include/cute/atom/partitioner.hpp' 2026-04-14T21:03:03,084 adding 'flashinfer/data/cutlass/include/cute/container/alignment.hpp' 2026-04-14T21:03:03,086 adding 'flashinfer/data/cutlass/include/cute/container/array.hpp' 2026-04-14T21:03:03,087 adding 'flashinfer/data/cutlass/include/cute/container/array_aligned.hpp' 2026-04-14T21:03:03,090 adding 'flashinfer/data/cutlass/include/cute/container/array_subbyte.hpp' 2026-04-14T21:03:03,091 adding 'flashinfer/data/cutlass/include/cute/container/bit_field.hpp' 2026-04-14T21:03:03,093 adding 'flashinfer/data/cutlass/include/cute/container/cuda_types.hpp' 2026-04-14T21:03:03,095 adding 'flashinfer/data/cutlass/include/cute/container/tuple.hpp' 2026-04-14T21:03:03,097 adding 'flashinfer/data/cutlass/include/cute/container/type_list.hpp' 2026-04-14T21:03:03,099 adding 'flashinfer/data/cutlass/include/cute/numeric/arithmetic_tuple.hpp' 2026-04-14T21:03:03,101 adding 'flashinfer/data/cutlass/include/cute/numeric/complex.hpp' 2026-04-14T21:03:03,102 adding 'flashinfer/data/cutlass/include/cute/numeric/int.hpp' 2026-04-14T21:03:03,104 adding 'flashinfer/data/cutlass/include/cute/numeric/integer_sequence.hpp' 2026-04-14T21:03:03,106 adding 'flashinfer/data/cutlass/include/cute/numeric/integral_constant.hpp' 2026-04-14T21:03:03,108 adding 'flashinfer/data/cutlass/include/cute/numeric/integral_ratio.hpp' 2026-04-14T21:03:03,109 adding 'flashinfer/data/cutlass/include/cute/numeric/math.hpp' 2026-04-14T21:03:03,111 adding 'flashinfer/data/cutlass/include/cute/numeric/numeric_types.hpp' 2026-04-14T21:03:03,112 adding 'flashinfer/data/cutlass/include/cute/numeric/real.hpp' 2026-04-14T21:03:03,114 adding 'flashinfer/data/cutlass/include/cute/util/debug.hpp' 2026-04-14T21:03:03,115 adding 'flashinfer/data/cutlass/include/cute/util/print.hpp' 2026-04-14T21:03:03,117 adding 'flashinfer/data/cutlass/include/cute/util/print_latex.hpp' 2026-04-14T21:03:03,119 adding 'flashinfer/data/cutlass/include/cute/util/print_svg.hpp' 2026-04-14T21:03:03,120 adding 'flashinfer/data/cutlass/include/cute/util/print_tensor.hpp' 2026-04-14T21:03:03,122 adding 'flashinfer/data/cutlass/include/cute/util/type_traits.hpp' 2026-04-14T21:03:03,125 adding 'flashinfer/data/cutlass/include/cutlass/aligned_buffer.h' 2026-04-14T21:03:03,129 adding 'flashinfer/data/cutlass/include/cutlass/array.h' 2026-04-14T21:03:03,131 adding 'flashinfer/data/cutlass/include/cutlass/array_planar_complex.h' 2026-04-14T21:03:03,133 adding 'flashinfer/data/cutlass/include/cutlass/array_subbyte.h' 2026-04-14T21:03:03,135 adding 'flashinfer/data/cutlass/include/cutlass/barrier.h' 2026-04-14T21:03:03,137 adding 'flashinfer/data/cutlass/include/cutlass/bfloat16.h' 2026-04-14T21:03:03,138 adding 'flashinfer/data/cutlass/include/cutlass/blas3.h' 2026-04-14T21:03:03,140 adding 'flashinfer/data/cutlass/include/cutlass/blas3_types.h' 2026-04-14T21:03:03,142 adding 'flashinfer/data/cutlass/include/cutlass/block_striped.h' 2026-04-14T21:03:03,144 adding 'flashinfer/data/cutlass/include/cutlass/cluster_launch.hpp' 2026-04-14T21:03:03,147 adding 'flashinfer/data/cutlass/include/cutlass/complex.h' 2026-04-14T21:03:03,150 adding 'flashinfer/data/cutlass/include/cutlass/constants.h' 2026-04-14T21:03:03,152 adding 'flashinfer/data/cutlass/include/cutlass/coord.h' 2026-04-14T21:03:03,154 adding 'flashinfer/data/cutlass/include/cutlass/core_io.h' 2026-04-14T21:03:03,156 adding 'flashinfer/data/cutlass/include/cutlass/cuda_host_adapter.hpp' 2026-04-14T21:03:03,158 adding 'flashinfer/data/cutlass/include/cutlass/cutlass.h' 2026-04-14T21:03:03,159 adding 'flashinfer/data/cutlass/include/cutlass/device_kernel.h' 2026-04-14T21:03:03,164 adding 'flashinfer/data/cutlass/include/cutlass/exmy_base.h' 2026-04-14T21:03:03,167 adding 'flashinfer/data/cutlass/include/cutlass/fast_math.h' 2026-04-14T21:03:03,171 adding 'flashinfer/data/cutlass/include/cutlass/float8.h' 2026-04-14T21:03:03,174 adding 'flashinfer/data/cutlass/include/cutlass/float_subbyte.h' 2026-04-14T21:03:03,176 adding 'flashinfer/data/cutlass/include/cutlass/floating_point_nvrtc.h' 2026-04-14T21:03:03,179 adding 'flashinfer/data/cutlass/include/cutlass/functional.h' 2026-04-14T21:03:03,181 adding 'flashinfer/data/cutlass/include/cutlass/gemm_coord.h' 2026-04-14T21:03:03,182 adding 'flashinfer/data/cutlass/include/cutlass/gemm_coord.hpp' 2026-04-14T21:03:03,185 adding 'flashinfer/data/cutlass/include/cutlass/half.h' 2026-04-14T21:03:03,186 adding 'flashinfer/data/cutlass/include/cutlass/integer_subbyte.h' 2026-04-14T21:03:03,188 adding 'flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.h' 2026-04-14T21:03:03,189 adding 'flashinfer/data/cutlass/include/cutlass/kernel_hardware_info.hpp' 2026-04-14T21:03:03,191 adding 'flashinfer/data/cutlass/include/cutlass/kernel_launch.h' 2026-04-14T21:03:03,212 adding 'flashinfer/data/cutlass/include/cutlass/matrix.h' 2026-04-14T21:03:03,216 adding 'flashinfer/data/cutlass/include/cutlass/matrix_coord.h' 2026-04-14T21:03:03,217 adding 'flashinfer/data/cutlass/include/cutlass/matrix_shape.h' 2026-04-14T21:03:03,233 adding 'flashinfer/data/cutlass/include/cutlass/numeric_conversion.h' 2026-04-14T21:03:03,237 adding 'flashinfer/data/cutlass/include/cutlass/numeric_size.h' 2026-04-14T21:03:03,238 adding 'flashinfer/data/cutlass/include/cutlass/numeric_types.h' 2026-04-14T21:03:03,239 adding 'flashinfer/data/cutlass/include/cutlass/pitch_linear_coord.h' 2026-04-14T21:03:03,241 adding 'flashinfer/data/cutlass/include/cutlass/predicate_vector.h' 2026-04-14T21:03:03,244 adding 'flashinfer/data/cutlass/include/cutlass/quaternion.h' 2026-04-14T21:03:03,245 adding 'flashinfer/data/cutlass/include/cutlass/real.h' 2026-04-14T21:03:03,247 adding 'flashinfer/data/cutlass/include/cutlass/relatively_equal.h' 2026-04-14T21:03:03,248 adding 'flashinfer/data/cutlass/include/cutlass/semaphore.h' 2026-04-14T21:03:03,251 adding 'flashinfer/data/cutlass/include/cutlass/subbyte_reference.h' 2026-04-14T21:03:03,253 adding 'flashinfer/data/cutlass/include/cutlass/tensor_coord.h' 2026-04-14T21:03:03,255 adding 'flashinfer/data/cutlass/include/cutlass/tensor_ref.h' 2026-04-14T21:03:03,257 adding 'flashinfer/data/cutlass/include/cutlass/tensor_ref_planar_complex.h' 2026-04-14T21:03:03,259 adding 'flashinfer/data/cutlass/include/cutlass/tensor_view.h' 2026-04-14T21:03:03,260 adding 'flashinfer/data/cutlass/include/cutlass/tensor_view_planar_complex.h' 2026-04-14T21:03:03,262 adding 'flashinfer/data/cutlass/include/cutlass/tfloat32.h' 2026-04-14T21:03:03,264 adding 'flashinfer/data/cutlass/include/cutlass/trace.h' 2026-04-14T21:03:03,266 adding 'flashinfer/data/cutlass/include/cutlass/uint128.h' 2026-04-14T21:03:03,267 adding 'flashinfer/data/cutlass/include/cutlass/uint256.h' 2026-04-14T21:03:03,268 adding 'flashinfer/data/cutlass/include/cutlass/version.h' 2026-04-14T21:03:03,270 adding 'flashinfer/data/cutlass/include/cutlass/wmma_array.h' 2026-04-14T21:03:03,271 adding 'flashinfer/data/cutlass/include/cutlass/workspace.h' 2026-04-14T21:03:03,274 adding 'flashinfer/data/cutlass/include/cutlass/arch/arch.h' 2026-04-14T21:03:03,277 adding 'flashinfer/data/cutlass/include/cutlass/arch/barrier.h' 2026-04-14T21:03:03,279 adding 'flashinfer/data/cutlass/include/cutlass/arch/cache_operation.h' 2026-04-14T21:03:03,280 adding 'flashinfer/data/cutlass/include/cutlass/arch/config.h' 2026-04-14T21:03:03,282 adding 'flashinfer/data/cutlass/include/cutlass/arch/grid_dependency_control.h' 2026-04-14T21:03:03,284 adding 'flashinfer/data/cutlass/include/cutlass/arch/memory.h' 2026-04-14T21:03:03,285 adding 'flashinfer/data/cutlass/include/cutlass/arch/memory_sm75.h' 2026-04-14T21:03:03,287 adding 'flashinfer/data/cutlass/include/cutlass/arch/memory_sm80.h' 2026-04-14T21:03:03,289 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma.h' 2026-04-14T21:03:03,290 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm100.h' 2026-04-14T21:03:03,292 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm50.h' 2026-04-14T21:03:03,294 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm60.h' 2026-04-14T21:03:03,295 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm61.h' 2026-04-14T21:03:03,297 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm70.h' 2026-04-14T21:03:03,299 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm75.h' 2026-04-14T21:03:03,301 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm80.h' 2026-04-14T21:03:03,303 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm89.h' 2026-04-14T21:03:03,305 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sm90.h' 2026-04-14T21:03:03,307 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm80.h' 2026-04-14T21:03:03,309 adding 'flashinfer/data/cutlass/include/cutlass/arch/mma_sparse_sm89.h' 2026-04-14T21:03:03,311 adding 'flashinfer/data/cutlass/include/cutlass/arch/reg_reconfig.h' 2026-04-14T21:03:03,312 adding 'flashinfer/data/cutlass/include/cutlass/arch/simd.h' 2026-04-14T21:03:03,314 adding 'flashinfer/data/cutlass/include/cutlass/arch/simd_sm60.h' 2026-04-14T21:03:03,315 adding 'flashinfer/data/cutlass/include/cutlass/arch/simd_sm61.h' 2026-04-14T21:03:03,318 adding 'flashinfer/data/cutlass/include/cutlass/arch/synclog.hpp' 2026-04-14T21:03:03,320 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma.h' 2026-04-14T21:03:03,322 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma_sm70.h' 2026-04-14T21:03:03,323 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma_sm72.h' 2026-04-14T21:03:03,324 adding 'flashinfer/data/cutlass/include/cutlass/arch/wmma_sm75.h' 2026-04-14T21:03:03,328 adding 'flashinfer/data/cutlass/include/cutlass/conv/conv2d_problem_size.h' 2026-04-14T21:03:03,330 adding 'flashinfer/data/cutlass/include/cutlass/conv/conv3d_problem_size.h' 2026-04-14T21:03:03,332 adding 'flashinfer/data/cutlass/include/cutlass/conv/convnd_problem_shape.hpp' 2026-04-14T21:03:03,334 adding 'flashinfer/data/cutlass/include/cutlass/conv/convolution.h' 2026-04-14T21:03:03,335 adding 'flashinfer/data/cutlass/include/cutlass/conv/detail.hpp' 2026-04-14T21:03:03,337 adding 'flashinfer/data/cutlass/include/cutlass/conv/dispatch_policy.hpp' 2026-04-14T21:03:03,339 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/collective_builder.hpp' 2026-04-14T21:03:03,340 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/collective_conv.hpp' 2026-04-14T21:03:03,342 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/detail.hpp' 2026-04-14T21:03:03,346 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/sm100_implicit_gemm_umma_warpspecialized.hpp' 2026-04-14T21:03:03,350 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/sm90_implicit_gemm_gmma_ss_warpspecialized.hpp' 2026-04-14T21:03:03,353 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_common.inl' 2026-04-14T21:03:03,355 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm100_umma_builder.inl' 2026-04-14T21:03:03,356 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_common.inl' 2026-04-14T21:03:03,358 adding 'flashinfer/data/cutlass/include/cutlass/conv/collective/builders/sm90_gmma_builder.inl' 2026-04-14T21:03:03,361 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/conv_universal_adapter.hpp' 2026-04-14T21:03:03,363 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/direct_convolution.h' 2026-04-14T21:03:03,365 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution.h' 2026-04-14T21:03:03,367 adding 'flashinfer/data/cutlass/include/cutlass/conv/device/implicit_gemm_convolution_fusion.h' 2026-04-14T21:03:03,369 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/conv_universal.hpp' 2026-04-14T21:03:03,370 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d.h' 2026-04-14T21:03:03,373 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_dgrad.h' 2026-04-14T21:03:03,376 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop.h' 2026-04-14T21:03:03,378 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_fusion.h' 2026-04-14T21:03:03,380 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_absmax.h' 2026-04-14T21:03:03,381 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_broadcast.h' 2026-04-14T21:03:03,383 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_fprop_with_reduction.h' 2026-04-14T21:03:03,385 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_group_fprop.h' 2026-04-14T21:03:03,387 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad.h' 2026-04-14T21:03:03,389 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv2d_wgrad_fusion.h' 2026-04-14T21:03:03,392 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_dgrad.h' 2026-04-14T21:03:03,394 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop.h' 2026-04-14T21:03:03,396 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_fusion.h' 2026-04-14T21:03:03,397 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_fprop_with_broadcast.h' 2026-04-14T21:03:03,399 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_conv3d_wgrad.h' 2026-04-14T21:03:03,401 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d.h' 2026-04-14T21:03:03,403 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv2d_with_broadcast.h' 2026-04-14T21:03:03,405 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d.h' 2026-04-14T21:03:03,407 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_deconv3d_with_broadcast.h' 2026-04-14T21:03:03,409 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/default_depthwise_fprop.h' 2026-04-14T21:03:03,411 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/direct_convolution.h' 2026-04-14T21:03:03,414 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution.h' 2026-04-14T21:03:03,416 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_fusion.h' 2026-04-14T21:03:03,418 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_strided_dgrad.h' 2026-04-14T21:03:03,421 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_absmax.h' 2026-04-14T21:03:03,423 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/implicit_gemm_convolution_with_fused_epilogue.h' 2026-04-14T21:03:03,428 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/sm100_implicit_gemm_tma_warpspecialized.hpp' 2026-04-14T21:03:03,429 adding 'flashinfer/data/cutlass/include/cutlass/conv/kernel/sm90_implicit_gemm_tma_warpspecialized.hpp' 2026-04-14T21:03:03,431 adding 'flashinfer/data/cutlass/include/cutlass/conv/thread/depthwise_mma.h' 2026-04-14T21:03:03,435 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_analytic.h' 2026-04-14T21:03:03,438 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_optimized.h' 2026-04-14T21:03:03,440 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_analytic.h' 2026-04-14T21:03:03,443 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_optimized.h' 2026-04-14T21:03:03,445 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_analytic.h' 2026-04-14T21:03:03,447 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_few_channels.h' 2026-04-14T21:03:03,449 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_fixed_channels.h' 2026-04-14T21:03:03,451 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_optimized.h' 2026-04-14T21:03:03,454 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_analytic.h' 2026-04-14T21:03:03,455 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_few_channels.h' 2026-04-14T21:03:03,457 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_fixed_channels.h' 2026-04-14T21:03:03,459 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_optimized.h' 2026-04-14T21:03:03,462 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_params.h' 2026-04-14T21:03:03,464 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_tile_iterator.h' 2026-04-14T21:03:03,466 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_analytic.h' 2026-04-14T21:03:03,468 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_optimized.h' 2026-04-14T21:03:03,470 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_analytic.h' 2026-04-14T21:03:03,472 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_optimized.h' 2026-04-14T21:03:03,474 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_analytic.h' 2026-04-14T21:03:03,476 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_optimized.h' 2026-04-14T21:03:03,478 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_analytic.h' 2026-04-14T21:03:03,480 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_optimized.h' 2026-04-14T21:03:03,482 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_analytic.h' 2026-04-14T21:03:03,485 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_optimized.h' 2026-04-14T21:03:03,487 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_analytic.h' 2026-04-14T21:03:03,489 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_optimized.h' 2026-04-14T21:03:03,491 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_params.h' 2026-04-14T21:03:03,493 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_analytic.h' 2026-04-14T21:03:03,495 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_optimized.h' 2026-04-14T21:03:03,496 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_analytic.h' 2026-04-14T21:03:03,498 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_optimized.h' 2026-04-14T21:03:03,500 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_direct_conv_params.h' 2026-04-14T21:03:03,502 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_fixed_stride_dilation.h' 2026-04-14T21:03:03,504 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_optimized.h' 2026-04-14T21:03:03,507 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_direct_conv_multistage.h' 2026-04-14T21:03:03,509 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_filter_tile_access_iterator_direct_conv_optimized.h' 2026-04-14T21:03:03,511 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_fprop_pipelined.h' 2026-04-14T21:03:03,513 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_base.h' 2026-04-14T21:03:03,516 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/depthwise_mma_core_with_lane_access_size.h' 2026-04-14T21:03:03,519 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_fprop_fusion_multistage.h' 2026-04-14T21:03:03,522 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_multistage.h' 2026-04-14T21:03:03,524 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_pipelined.h' 2026-04-14T21:03:03,527 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/implicit_gemm_wgrad_fusion_multistage.h' 2026-04-14T21:03:03,529 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_access_iterator.h' 2026-04-14T21:03:03,531 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/predicated_scale_bias_vector_iterator.h' 2026-04-14T21:03:03,533 adding 'flashinfer/data/cutlass/include/cutlass/conv/threadblock/threadblock_swizzle.h' 2026-04-14T21:03:03,535 adding 'flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt.h' 2026-04-14T21:03:03,538 adding 'flashinfer/data/cutlass/include/cutlass/conv/warp/mma_depthwise_simt_tile_iterator.h' 2026-04-14T21:03:03,540 adding 'flashinfer/data/cutlass/include/cutlass/conv/warp/scale_bias_relu_transform.h' 2026-04-14T21:03:03,542 adding 'flashinfer/data/cutlass/include/cutlass/detail/blockwise_scale_layout.hpp' 2026-04-14T21:03:03,544 adding 'flashinfer/data/cutlass/include/cutlass/detail/cluster.hpp' 2026-04-14T21:03:03,545 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective.hpp' 2026-04-14T21:03:03,547 adding 'flashinfer/data/cutlass/include/cutlass/detail/dependent_false.hpp' 2026-04-14T21:03:03,549 adding 'flashinfer/data/cutlass/include/cutlass/detail/helper_macros.hpp' 2026-04-14T21:03:03,551 adding 'flashinfer/data/cutlass/include/cutlass/detail/layout.hpp' 2026-04-14T21:03:03,552 adding 'flashinfer/data/cutlass/include/cutlass/detail/mainloop_fusion_helper_scale_factor.hpp' 2026-04-14T21:03:03,554 adding 'flashinfer/data/cutlass/include/cutlass/detail/mma.hpp' 2026-04-14T21:03:03,556 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm100_blockscaled_layout.hpp' 2026-04-14T21:03:03,558 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm100_mixed_dtype_blockwise_layout.hpp' 2026-04-14T21:03:03,559 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm100_tmem_helper.hpp' 2026-04-14T21:03:03,561 adding 'flashinfer/data/cutlass/include/cutlass/detail/sm103_blockscaled_layout.hpp' 2026-04-14T21:03:03,566 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective/mixed_input_utils.hpp' 2026-04-14T21:03:03,568 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective/moe_stride_utils.hpp' 2026-04-14T21:03:03,569 adding 'flashinfer/data/cutlass/include/cutlass/detail/collective/sm103_kernel_type.hpp' 2026-04-14T21:03:03,572 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/dispatch_policy.hpp' 2026-04-14T21:03:03,574 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_builder.hpp' 2026-04-14T21:03:03,576 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/collective_epilogue.hpp' 2026-04-14T21:03:03,578 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue.hpp' 2026-04-14T21:03:03,580 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/default_epilogue_array.hpp' 2026-04-14T21:03:03,582 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/detail.hpp' 2026-04-14T21:03:03,584 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/epilogue_tensor_broadcast.hpp' 2026-04-14T21:03:03,588 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_nosmem.hpp' 2026-04-14T21:03:03,591 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_nosmem.hpp' 2026-04-14T21:03:03,596 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_planar_complex_tma_warpspecialized.hpp' 2026-04-14T21:03:03,604 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_array_tma_warpspecialized.hpp' 2026-04-14T21:03:03,608 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_nosmem.hpp' 2026-04-14T21:03:03,613 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_planar_complex_tma_warpspecialized.hpp' 2026-04-14T21:03:03,619 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm100_epilogue_tma_warpspecialized.hpp' 2026-04-14T21:03:03,622 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized.hpp' 2026-04-14T21:03:03,624 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm70_epilogue_vectorized_array.hpp' 2026-04-14T21:03:03,630 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_array_tma_warpspecialized.hpp' 2026-04-14T21:03:03,635 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized.hpp' 2026-04-14T21:03:03,637 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized_bias_elementwise.hpp' 2026-04-14T21:03:03,644 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm100_builder.inl' 2026-04-14T21:03:03,646 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm103_builder.inl' 2026-04-14T21:03:03,649 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_builder.inl' 2026-04-14T21:03:03,651 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm120_common.inl' 2026-04-14T21:03:03,654 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_builder.inl' 2026-04-14T21:03:03,656 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/collective/builders/sm90_common.inl' 2026-04-14T21:03:03,658 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/callbacks.hpp' 2026-04-14T21:03:03,660 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/operations.hpp' 2026-04-14T21:03:03,663 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_callbacks_tma_warpspecialized.hpp' 2026-04-14T21:03:03,666 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_compute_tma_warpspecialized.hpp' 2026-04-14T21:03:03,670 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm100_visitor_store_tma_warpspecialized.hpp' 2026-04-14T21:03:03,673 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_callbacks_tma_warpspecialized.hpp' 2026-04-14T21:03:03,677 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm120_visitor_store_tma_warpspecialized.hpp' 2026-04-14T21:03:03,683 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_callbacks_tma_warpspecialized.hpp' 2026-04-14T21:03:03,687 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_compute_tma_warpspecialized.hpp' 2026-04-14T21:03:03,692 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_load_tma_warpspecialized.hpp' 2026-04-14T21:03:03,699 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_store_tma_warpspecialized.hpp' 2026-04-14T21:03:03,703 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_tma_warpspecialized.hpp' 2026-04-14T21:03:03,707 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/fusion/sm90_visitor_topk_softmax.hpp' 2026-04-14T21:03:03,710 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/activation.h' 2026-04-14T21:03:03,712 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/conversion_op.h' 2026-04-14T21:03:03,713 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/detail.hpp' 2026-04-14T21:03:03,716 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination.h' 2026-04-14T21:03:03,718 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_elementwise.h' 2026-04-14T21:03:03,721 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_bias_relu.h' 2026-04-14T21:03:03,723 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_clamp.h' 2026-04-14T21:03:03,725 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_dgelu.h' 2026-04-14T21:03:03,727 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_drelu.h' 2026-04-14T21:03:03,729 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_gelu.h' 2026-04-14T21:03:03,731 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic.h' 2026-04-14T21:03:03,733 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_generic_with_scaling.h' 2026-04-14T21:03:03,735 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_hardswish.h' 2026-04-14T21:03:03,736 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_leaky_relu.h' 2026-04-14T21:03:03,738 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_params.h' 2026-04-14T21:03:03,739 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_planar_complex.h' 2026-04-14T21:03:03,741 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu.h' 2026-04-14T21:03:03,744 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_relu0.h' 2026-04-14T21:03:03,745 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_residual_block.h' 2026-04-14T21:03:03,747 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_sigmoid.h' 2026-04-14T21:03:03,748 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_silu.h' 2026-04-14T21:03:03,750 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_tensor_broadcast.hpp' 2026-04-14T21:03:03,751 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/linear_combination_with_elementwise.h' 2026-04-14T21:03:03,753 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/reduction_op.h' 2026-04-14T21:03:03,754 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/thread/scale_type.h' 2026-04-14T21:03:03,757 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op.h' 2026-04-14T21:03:03,759 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op_blas3.h' 2026-04-14T21:03:03,760 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_direct_store.h' 2026-04-14T21:03:03,761 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_planar_complex.h' 2026-04-14T21:03:03,763 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_simt.h' 2026-04-14T21:03:03,766 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op.h' 2026-04-14T21:03:03,767 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op_blas3.h' 2026-04-14T21:03:03,769 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_volta_tensor_op.h' 2026-04-14T21:03:03,771 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_absmax.h' 2026-04-14T21:03:03,772 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_broadcast.h' 2026-04-14T21:03:03,774 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_with_reduction.h' 2026-04-14T21:03:03,775 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_epilogue_wmma_tensor_op.h' 2026-04-14T21:03:03,777 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_simt.h' 2026-04-14T21:03:03,778 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_tensor_op.h' 2026-04-14T21:03:03,780 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_volta_tensor_op.h' 2026-04-14T21:03:03,781 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/default_thread_map_wmma_tensor_op.h' 2026-04-14T21:03:03,783 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/direct_store_epilogue_iterator.h' 2026-04-14T21:03:03,785 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue.h' 2026-04-14T21:03:03,787 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base.h' 2026-04-14T21:03:03,788 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_base_streamk.h' 2026-04-14T21:03:03,790 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_depthwise.h' 2026-04-14T21:03:03,792 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_direct_store.h' 2026-04-14T21:03:03,794 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_gemm_k_reduction.h' 2026-04-14T21:03:03,796 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_planar_complex.h' 2026-04-14T21:03:03,797 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_smem_accumulator.h' 2026-04-14T21:03:03,799 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_streamk_with_broadcast.h' 2026-04-14T21:03:03,802 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_visitor_with_softmax.h' 2026-04-14T21:03:03,805 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_absmax.h' 2026-04-14T21:03:03,810 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_broadcast.h' 2026-04-14T21:03:03,813 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_reduction.h' 2026-04-14T21:03:03,815 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_scaling_factor.h' 2026-04-14T21:03:03,817 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor.h' 2026-04-14T21:03:03,820 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_with_visitor_callbacks.h' 2026-04-14T21:03:03,822 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/epilogue_workspace.h' 2026-04-14T21:03:03,824 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/interleaved_epilogue.h' 2026-04-14T21:03:03,825 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_iterator_parameter.h' 2026-04-14T21:03:03,828 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/output_tile_thread_map.h' 2026-04-14T21:03:03,831 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator.h' 2026-04-14T21:03:03,834 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine.h' 2026-04-14T21:03:03,836 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine_layout_params.h' 2026-04-14T21:03:03,838 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_blas3.h' 2026-04-14T21:03:03,841 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_conv.h' 2026-04-14T21:03:03,843 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_direct_conv.h' 2026-04-14T21:03:03,845 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_params.h' 2026-04-14T21:03:03,847 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_predicates.h' 2026-04-14T21:03:03,849 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/predicated_tile_iterator_strided_dgrad.h' 2026-04-14T21:03:03,851 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator.h' 2026-04-14T21:03:03,853 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_mixed.h' 2026-04-14T21:03:03,855 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/shared_load_iterator_pitch_linear.h' 2026-04-14T21:03:03,857 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_2x.hpp' 2026-04-14T21:03:03,859 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_compute.hpp' 2026-04-14T21:03:03,861 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_load.hpp' 2026-04-14T21:03:03,864 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitor_store.hpp' 2026-04-14T21:03:03,865 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/threadblock/fusion/visitors.hpp' 2026-04-14T21:03:03,868 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_complex_tensor_op.h' 2026-04-14T21:03:03,869 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_gaussian_complex_tensor_op.h' 2026-04-14T21:03:03,871 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_simt.h' 2026-04-14T21:03:03,873 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_tensor_op.h' 2026-04-14T21:03:03,875 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_volta_tensor_op.h' 2026-04-14T21:03:03,876 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/fragment_iterator_wmma_tensor_op.h' 2026-04-14T21:03:03,878 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/simt_policy.h' 2026-04-14T21:03:03,879 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tensor_op_policy.h' 2026-04-14T21:03:03,882 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_simt.h' 2026-04-14T21:03:03,884 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op.h' 2026-04-14T21:03:03,887 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_tensor_op_mixed.h' 2026-04-14T21:03:03,889 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_volta_tensor_op.h' 2026-04-14T21:03:03,891 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/tile_iterator_wmma_tensor_op.h' 2026-04-14T21:03:03,893 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/volta_tensor_op_policy.h' 2026-04-14T21:03:03,894 adding 'flashinfer/data/cutlass/include/cutlass/epilogue/warp/wmma_tensor_op_policy.h' 2026-04-14T21:03:03,897 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/detail.hpp' 2026-04-14T21:03:03,900 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/dist_gemm_universal_wrapper.hpp' 2026-04-14T21:03:03,902 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/device/full_barrier.hpp' 2026-04-14T21:03:03,904 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/detail.hpp' 2026-04-14T21:03:03,906 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/dist_gemm_kernel_wrapper.hpp' 2026-04-14T21:03:03,907 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/kernel/full_barrier.hpp' 2026-04-14T21:03:03,910 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_1d_schedules.hpp' 2026-04-14T21:03:03,913 adding 'flashinfer/data/cutlass/include/cutlass/experimental/distributed/schedules/dist_gemm_base_schedule.hpp' 2026-04-14T21:03:03,919 adding 'flashinfer/data/cutlass/include/cutlass/gemm/dispatch_policy.hpp' 2026-04-14T21:03:03,921 adding 'flashinfer/data/cutlass/include/cutlass/gemm/gemm.h' 2026-04-14T21:03:03,922 adding 'flashinfer/data/cutlass/include/cutlass/gemm/gemm_enumerated_types.h' 2026-04-14T21:03:03,924 adding 'flashinfer/data/cutlass/include/cutlass/gemm/group_array_problem_shape.hpp' 2026-04-14T21:03:03,926 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder.hpp' 2026-04-14T21:03:03,928 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_builder_decl.hpp' 2026-04-14T21:03:03,929 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma.hpp' 2026-04-14T21:03:03,931 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/collective_mma_decl.hpp' 2026-04-14T21:03:03,932 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/fp8_accumulation.hpp' 2026-04-14T21:03:03,939 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized.hpp' 2026-04-14T21:03:03,946 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_array_warpspecialized_rcggemm.hpp' 2026-04-14T21:03:03,951 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_mixed_tma_cpasync_warpspecialized.hpp' 2026-04-14T21:03:03,957 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_mma_warpspecialized.hpp' 2026-04-14T21:03:03,964 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_blockscaled_sparse_mma_warpspecialized.hpp' 2026-04-14T21:03:03,969 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized.hpp' 2026-04-14T21:03:03,975 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_blockwise_scaling.hpp' 2026-04-14T21:03:03,981 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_emulated.hpp' 2026-04-14T21:03:03,988 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_emulated.hpp' 2026-04-14T21:03:03,993 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_interleaved_complex_tf32.hpp' 2026-04-14T21:03:03,999 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_planar_complex.hpp' 2026-04-14T21:03:04,003 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_array_warpspecialized_rcggemm.hpp' 2026-04-14T21:03:04,007 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_cpasync_warpspecialized.hpp' 2026-04-14T21:03:04,011 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_mixed_tma_cpasync_warpspecialized.hpp' 2026-04-14T21:03:04,014 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized.hpp' 2026-04-14T21:03:04,020 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_blockwise_scaling.hpp' 2026-04-14T21:03:04,026 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_emulated.hpp' 2026-04-14T21:03:04,032 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_emulated.hpp' 2026-04-14T21:03:04,037 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_interleaved_complex_tf32.hpp' 2026-04-14T21:03:04,043 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_mixed_input.hpp' 2026-04-14T21:03:04,047 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_mma_warpspecialized_planar_complex.hpp' 2026-04-14T21:03:04,052 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm100_sparse_mma_warpspecialized.hpp' 2026-04-14T21:03:04,060 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_array_warpspecialized.hpp' 2026-04-14T21:03:04,067 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm103_blockscaled_mma_warpspecialized.hpp' 2026-04-14T21:03:04,072 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_array_tma.hpp' 2026-04-14T21:03:04,077 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_mma_tma.hpp' 2026-04-14T21:03:04,083 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_blockscaled_sparse_mma_tma.hpp' 2026-04-14T21:03:04,088 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_array_tma_blockwise_scaling.hpp' 2026-04-14T21:03:04,092 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma.hpp' 2026-04-14T21:03:04,096 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_mma_tma_blockwise_scaling.hpp' 2026-04-14T21:03:04,101 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm120_sparse_mma_tma.hpp' 2026-04-14T21:03:04,103 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm70_mma_twostage.hpp' 2026-04-14T21:03:04,106 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_array_multistage.hpp' 2026-04-14T21:03:04,109 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm80_mma_multistage.hpp' 2026-04-14T21:03:04,115 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input.hpp' 2026-04-14T21:03:04,119 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized.hpp' 2026-04-14T21:03:04,123 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8.hpp' 2026-04-14T21:03:04,129 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp' 2026-04-14T21:03:04,133 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_rs_warpspecialized.hpp' 2026-04-14T21:03:04,135 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_ss_warpspecialized.hpp' 2026-04-14T21:03:04,139 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp' 2026-04-14T21:03:04,144 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized_mixed_input.hpp' 2026-04-14T21:03:04,148 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss.hpp' 2026-04-14T21:03:04,151 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized.hpp' 2026-04-14T21:03:04,154 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8.hpp' 2026-04-14T21:03:04,159 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp' 2026-04-14T21:03:04,163 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized.hpp' 2026-04-14T21:03:04,168 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized_fp8.hpp' 2026-04-14T21:03:04,171 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_interleaved_complex_umma_builder.inl' 2026-04-14T21:03:04,173 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_9xBF16_umma_builder.inl' 2026-04-14T21:03:04,175 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_mixed_tma_cpasync_umma_builder.inl' 2026-04-14T21:03:04,178 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_sparse_umma_builder.inl' 2026-04-14T21:03:04,180 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockscaled_umma_builder.inl' 2026-04-14T21:03:04,183 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_blockwise_umma_builder.inl' 2026-04-14T21:03:04,186 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_common.inl' 2026-04-14T21:03:04,188 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_cpasync_umma_builder.inl' 2026-04-14T21:03:04,190 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_interleaved_complex_umma_builder.inl' 2026-04-14T21:03:04,193 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_input_umma_builder.inl' 2026-04-14T21:03:04,195 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_mixed_tma_cpasync_umma_builder.inl' 2026-04-14T21:03:04,196 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_pipeline_carveout.inl' 2026-04-14T21:03:04,198 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_planar_complex_umma_builder.inl' 2026-04-14T21:03:04,200 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_simt_builder.inl' 2026-04-14T21:03:04,202 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_sparse_umma_builder.inl' 2026-04-14T21:03:04,205 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm100_umma_builder.inl' 2026-04-14T21:03:04,208 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm103_blockscaled_umma_builder.inl' 2026-04-14T21:03:04,210 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_mma_builder.inl' 2026-04-14T21:03:04,213 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockscaled_sparse_mma_builder.inl' 2026-04-14T21:03:04,215 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_blockwise_mma_builder.inl' 2026-04-14T21:03:04,216 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_common.inl' 2026-04-14T21:03:04,218 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_mma_builder.inl' 2026-04-14T21:03:04,220 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm120_sparse_mma_builder.inl' 2026-04-14T21:03:04,224 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_common.inl' 2026-04-14T21:03:04,226 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm1xx_sparse_config.inl' 2026-04-14T21:03:04,229 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_common.inl' 2026-04-14T21:03:04,233 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_gmma_builder.inl' 2026-04-14T21:03:04,235 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_config.inl' 2026-04-14T21:03:04,237 adding 'flashinfer/data/cutlass/include/cutlass/gemm/collective/builders/sm90_sparse_gmma_builder.inl' 2026-04-14T21:03:04,241 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/base_grouped.h' 2026-04-14T21:03:04,243 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/default_gemm_configuration.h' 2026-04-14T21:03:04,246 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/ell_gemm.h' 2026-04-14T21:03:04,249 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm.h' 2026-04-14T21:03:04,252 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_array.h' 2026-04-14T21:03:04,254 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_batched.h' 2026-04-14T21:03:04,258 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_blockwise.h' 2026-04-14T21:03:04,260 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_complex.h' 2026-04-14T21:03:04,262 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_grouped.h' 2026-04-14T21:03:04,264 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_layernorm_mainloop_fusion.h' 2026-04-14T21:03:04,267 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse.h' 2026-04-14T21:03:04,268 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal.h' 2026-04-14T21:03:04,270 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_universal_with_absmax.h' 2026-04-14T21:03:04,272 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_absmax.h' 2026-04-14T21:03:04,274 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_sparse_with_visitor.h' 2026-04-14T21:03:04,277 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_splitk_parallel.h' 2026-04-14T21:03:04,279 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal.h' 2026-04-14T21:03:04,282 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_adapter.h' 2026-04-14T21:03:04,285 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_base.h' 2026-04-14T21:03:04,286 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_streamk_with_broadcast.h' 2026-04-14T21:03:04,288 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_absmax.h' 2026-04-14T21:03:04,290 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_universal_with_broadcast.h' 2026-04-14T21:03:04,292 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemm_with_k_reduction.h' 2026-04-14T21:03:04,294 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemv.h' 2026-04-14T21:03:04,296 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/gemv_blockscaled.h' 2026-04-14T21:03:04,298 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k.h' 2026-04-14T21:03:04,299 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/rank_2k_grouped.h' 2026-04-14T21:03:04,302 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/rank_k.h' 2026-04-14T21:03:04,304 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/symm.h' 2026-04-14T21:03:04,308 adding 'flashinfer/data/cutlass/include/cutlass/gemm/device/trmm.h' 2026-04-14T21:03:04,313 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_ell_gemm.h' 2026-04-14T21:03:04,316 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm.h' 2026-04-14T21:03:04,318 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_complex.h' 2026-04-14T21:03:04,320 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped.h' 2026-04-14T21:03:04,322 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_per_group_scale.h' 2026-04-14T21:03:04,323 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_grouped_softmax_mainloop_fusion.h' 2026-04-14T21:03:04,325 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_layernorm_mainloop_fusion.h' 2026-04-14T21:03:04,327 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_planar_complex_universal.h' 2026-04-14T21:03:04,328 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse.h' 2026-04-14T21:03:04,330 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal.h' 2026-04-14T21:03:04,332 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_universal_with_absmax.h' 2026-04-14T21:03:04,333 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_absmax.h' 2026-04-14T21:03:04,335 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_sparse_with_visitor.h' 2026-04-14T21:03:04,336 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_splitk_parallel.h' 2026-04-14T21:03:04,338 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_streamk_with_broadcast.h' 2026-04-14T21:03:04,339 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal.h' 2026-04-14T21:03:04,341 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_universal_with_visitor.h' 2026-04-14T21:03:04,342 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_absmax.h' 2026-04-14T21:03:04,344 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_broadcast.h' 2026-04-14T21:03:04,346 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_k_reduction.h' 2026-04-14T21:03:04,347 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemm_with_reduction.h' 2026-04-14T21:03:04,349 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_gemv.h' 2026-04-14T21:03:04,351 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k.h' 2026-04-14T21:03:04,352 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_complex.h' 2026-04-14T21:03:04,354 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_grouped.h' 2026-04-14T21:03:04,356 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_2k_universal.h' 2026-04-14T21:03:04,358 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k.h' 2026-04-14T21:03:04,360 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_complex.h' 2026-04-14T21:03:04,361 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_rank_k_universal.h' 2026-04-14T21:03:04,363 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm.h' 2026-04-14T21:03:04,365 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_complex.h' 2026-04-14T21:03:04,367 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_symm_universal.h' 2026-04-14T21:03:04,368 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm.h' 2026-04-14T21:03:04,370 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_complex.h' 2026-04-14T21:03:04,372 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/default_trmm_universal.h' 2026-04-14T21:03:04,374 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/ell_gemm.h' 2026-04-14T21:03:04,376 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm.h' 2026-04-14T21:03:04,378 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_array.h' 2026-04-14T21:03:04,380 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_batched.h' 2026-04-14T21:03:04,381 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_blockwise.h' 2026-04-14T21:03:04,384 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped.h' 2026-04-14T21:03:04,386 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_per_group_scale.h' 2026-04-14T21:03:04,387 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_problem_visitor.h' 2026-04-14T21:03:04,389 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_grouped_softmax_mainloop_fusion.h' 2026-04-14T21:03:04,392 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_layernorm_mainloop_fusion.h' 2026-04-14T21:03:04,394 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_params.h' 2026-04-14T21:03:04,395 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_pipelined.h' 2026-04-14T21:03:04,398 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex.h' 2026-04-14T21:03:04,401 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_planar_complex_array.h' 2026-04-14T21:03:04,404 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal.h' 2026-04-14T21:03:04,407 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_sparse_universal_with_absmax.h' 2026-04-14T21:03:04,408 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_splitk_parallel.h' 2026-04-14T21:03:04,417 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_streamk_with_fused_epilogue.h' 2026-04-14T21:03:04,419 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_transpose_operands.h' 2026-04-14T21:03:04,421 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.h' 2026-04-14T21:03:04,423 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal.hpp' 2026-04-14T21:03:04,425 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_blockwise.h' 2026-04-14T21:03:04,426 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_decl.h' 2026-04-14T21:03:04,431 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_streamk.h' 2026-04-14T21:03:04,433 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor.h' 2026-04-14T21:03:04,436 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_universal_with_visitor_streamk.h' 2026-04-14T21:03:04,439 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_absmax.h' 2026-04-14T21:03:04,443 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_fused_epilogue.h' 2026-04-14T21:03:04,446 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemm_with_k_reduction.h' 2026-04-14T21:03:04,448 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv.h' 2026-04-14T21:03:04,450 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_batched_strided.h' 2026-04-14T21:03:04,454 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/gemv_blockscaled.h' 2026-04-14T21:03:04,456 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/grouped_problem_visitor.h' 2026-04-14T21:03:04,458 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_sparse_base.h' 2026-04-14T21:03:04,460 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/params_universal_base.h' 2026-04-14T21:03:04,462 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped.h' 2026-04-14T21:03:04,465 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_grouped_problem_visitor.h' 2026-04-14T21:03:04,466 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_transpose_operands.h' 2026-04-14T21:03:04,469 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_2k_universal.h' 2026-04-14T21:03:04,472 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/rank_k_universal.h' 2026-04-14T21:03:04,479 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized.hpp' 2026-04-14T21:03:04,485 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_input_transform.hpp' 2026-04-14T21:03:04,491 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_array_tma_warpspecialized_mma_transform.hpp' 2026-04-14T21:03:04,495 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_cpasync_warpspecialized.hpp' 2026-04-14T21:03:04,500 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_mixed_tma_cpasync_warpspecialized.hpp' 2026-04-14T21:03:04,505 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized.hpp' 2026-04-14T21:03:04,510 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_input_transform.hpp' 2026-04-14T21:03:04,516 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mixed_input_transform.hpp' 2026-04-14T21:03:04,521 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mma_transform.hpp' 2026-04-14T21:03:04,526 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_sparse_gemm_tma_warpspecialized.hpp' 2026-04-14T21:03:04,528 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_static_tile_scheduler.hpp' 2026-04-14T21:03:04,531 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler.hpp' 2026-04-14T21:03:04,534 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_group.hpp' 2026-04-14T21:03:04,538 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm100_tile_scheduler_stream_k.hpp' 2026-04-14T21:03:04,544 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_array_tma_warpspecialized.hpp' 2026-04-14T21:03:04,549 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm103_blockscaled_gemm_tma_warpspecialized.hpp' 2026-04-14T21:03:04,554 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm120_gemm_tma_warpspecialized_cooperative_asymmetric_dma.hpp' 2026-04-14T21:03:04,556 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm.hpp' 2026-04-14T21:03:04,558 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm70_gemm_array.hpp' 2026-04-14T21:03:04,563 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_cooperative.hpp' 2026-04-14T21:03:04,569 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_pingpong.hpp' 2026-04-14T21:03:04,571 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma.hpp' 2026-04-14T21:03:04,574 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized.hpp' 2026-04-14T21:03:04,578 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_cooperative.hpp' 2026-04-14T21:03:04,583 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_pingpong.hpp' 2026-04-14T21:03:04,586 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized.hpp' 2026-04-14T21:03:04,589 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_cooperative.hpp' 2026-04-14T21:03:04,592 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_pingpong.hpp' 2026-04-14T21:03:04,594 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler.hpp' 2026-04-14T21:03:04,597 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_group.hpp' 2026-04-14T21:03:04,602 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sm90_tile_scheduler_stream_k.hpp' 2026-04-14T21:03:04,605 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm.h' 2026-04-14T21:03:04,607 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_absmax.h' 2026-04-14T21:03:04,609 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/sparse_gemm_with_visitor.h' 2026-04-14T21:03:04,612 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/static_tile_scheduler.hpp' 2026-04-14T21:03:04,615 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/symm_universal.h' 2026-04-14T21:03:04,617 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler.hpp' 2026-04-14T21:03:04,618 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_detail.hpp' 2026-04-14T21:03:04,627 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/tile_scheduler_params.h' 2026-04-14T21:03:04,630 adding 'flashinfer/data/cutlass/include/cutlass/gemm/kernel/trmm_universal.h' 2026-04-14T21:03:04,632 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma.h' 2026-04-14T21:03:04,634 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm50.h' 2026-04-14T21:03:04,637 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm60.h' 2026-04-14T21:03:04,639 adding 'flashinfer/data/cutlass/include/cutlass/gemm/thread/mma_sm61.h' 2026-04-14T21:03:04,642 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_ell_mma.h' 2026-04-14T21:03:04,644 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_gemv_core.h' 2026-04-14T21:03:04,647 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma.h' 2026-04-14T21:03:04,648 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core.h' 2026-04-14T21:03:04,651 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_simt.h' 2026-04-14T21:03:04,654 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm70.h' 2026-04-14T21:03:04,657 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm75.h' 2026-04-14T21:03:04,662 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sm80.h' 2026-04-14T21:03:04,665 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_sparse_sm80.h' 2026-04-14T21:03:04,667 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_access_size.h' 2026-04-14T21:03:04,669 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_with_reduction.h' 2026-04-14T21:03:04,672 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_core_wmma.h' 2026-04-14T21:03:04,674 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_layernorm_mainloop_fusion.h' 2026-04-14T21:03:04,675 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_multistage_blockwise.h' 2026-04-14T21:03:04,677 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_multistage.h' 2026-04-14T21:03:04,678 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_planar_complex_pipelined.h' 2026-04-14T21:03:04,680 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_softmax_mainloop_fusion.h' 2026-04-14T21:03:04,682 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_mma_with_reduction.h' 2026-04-14T21:03:04,683 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex.h' 2026-04-14T21:03:04,685 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core.h' 2026-04-14T21:03:04,689 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core_sm80.h' 2026-04-14T21:03:04,691 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_multistage_trmm_complex.h' 2026-04-14T21:03:04,693 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_sparse_mma.h' 2026-04-14T21:03:04,695 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/default_trmm.h' 2026-04-14T21:03:04,698 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_multistage.h' 2026-04-14T21:03:04,700 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/ell_mma_pipelined.h' 2026-04-14T21:03:04,702 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/gemv.h' 2026-04-14T21:03:04,703 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/index_remat.h' 2026-04-14T21:03:04,705 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_base.h' 2026-04-14T21:03:04,708 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_blas3_multistage.h' 2026-04-14T21:03:04,712 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_layernorm_mainloop_fusion_multistage.h' 2026-04-14T21:03:04,715 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage.h' 2026-04-14T21:03:04,718 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_multistage_blockwise.h' 2026-04-14T21:03:04,720 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_pipelined.h' 2026-04-14T21:03:04,722 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_base.h' 2026-04-14T21:03:04,725 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_multistage.h' 2026-04-14T21:03:04,727 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_planar_complex_pipelined.h' 2026-04-14T21:03:04,729 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_singlestage.h' 2026-04-14T21:03:04,732 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_softmax_mainloop_fusion_multistage.h' 2026-04-14T21:03:04,734 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_base.h' 2026-04-14T21:03:04,737 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_sparse_multistage.h' 2026-04-14T21:03:04,740 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/mma_with_reduction_multistage.h' 2026-04-14T21:03:04,742 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle.h' 2026-04-14T21:03:04,745 adding 'flashinfer/data/cutlass/include/cutlass/gemm/threadblock/threadblock_swizzle_streamk.h' 2026-04-14T21:03:04,749 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_complex_tensor_op.h' 2026-04-14T21:03:04,750 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_sparse_tensor_op.h' 2026-04-14T21:03:04,752 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op.h' 2026-04-14T21:03:04,754 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_tensor_op_sm80.h' 2026-04-14T21:03:04,756 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_with_reduction_tensor_op.h' 2026-04-14T21:03:04,757 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/default_mma_wmma_tensor_op.h' 2026-04-14T21:03:04,759 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/layernorm_scale_bias_transform.h' 2026-04-14T21:03:04,760 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma.h' 2026-04-14T21:03:04,763 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op.h' 2026-04-14T21:03:04,766 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_fast_f32.h' 2026-04-14T21:03:04,771 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_complex_tensor_op_tile_iterator_sm80.h' 2026-04-14T21:03:04,774 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op.h' 2026-04-14T21:03:04,776 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op_tile_iterator_sm80.h' 2026-04-14T21:03:04,778 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_mixed_input_tensor_op.h' 2026-04-14T21:03:04,780 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_planar_complex.h' 2026-04-14T21:03:04,782 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt.h' 2026-04-14T21:03:04,784 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_policy.h' 2026-04-14T21:03:04,787 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_simt_tile_iterator.h' 2026-04-14T21:03:04,790 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_sparse_tensor_op.h' 2026-04-14T21:03:04,792 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op.h' 2026-04-14T21:03:04,794 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fast_f32.h' 2026-04-14T21:03:04,796 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_fragment_iterator.h' 2026-04-14T21:03:04,798 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_policy.h' 2026-04-14T21:03:04,800 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_sm70.h' 2026-04-14T21:03:04,802 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_access_iterator.h' 2026-04-14T21:03:04,811 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator.h' 2026-04-14T21:03:04,818 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm70.h' 2026-04-14T21:03:04,822 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm80.h' 2026-04-14T21:03:04,825 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sparse.h' 2026-04-14T21:03:04,827 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_wmma.h' 2026-04-14T21:03:04,829 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_tensor_op_wmma.h' 2026-04-14T21:03:04,831 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/mma_with_reduction_tensor_op.h' 2026-04-14T21:03:04,833 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/scale_bias_tile_iterator.h' 2026-04-14T21:03:04,835 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/softmax_scale_bias_transform.h' 2026-04-14T21:03:04,837 adding 'flashinfer/data/cutlass/include/cutlass/gemm/warp/tile_iterator_planar_complex.h' 2026-04-14T21:03:04,839 adding 'flashinfer/data/cutlass/include/cutlass/layout/layout.h' 2026-04-14T21:03:04,842 adding 'flashinfer/data/cutlass/include/cutlass/layout/matrix.h' 2026-04-14T21:03:04,844 adding 'flashinfer/data/cutlass/include/cutlass/layout/permute.h' 2026-04-14T21:03:04,846 adding 'flashinfer/data/cutlass/include/cutlass/layout/pitch_linear.h' 2026-04-14T21:03:04,848 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor.h' 2026-04-14T21:03:04,850 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm70.h' 2026-04-14T21:03:04,853 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm75.h' 2026-04-14T21:03:04,855 adding 'flashinfer/data/cutlass/include/cutlass/layout/tensor_op_multiplicand_sm80.h' 2026-04-14T21:03:04,856 adding 'flashinfer/data/cutlass/include/cutlass/layout/vector.h' 2026-04-14T21:03:04,858 adding 'flashinfer/data/cutlass/include/cutlass/pipeline/pipeline.hpp' 2026-04-14T21:03:04,862 adding 'flashinfer/data/cutlass/include/cutlass/pipeline/sm100_pipeline.hpp' 2026-04-14T21:03:04,866 adding 'flashinfer/data/cutlass/include/cutlass/pipeline/sm90_pipeline.hpp' 2026-04-14T21:03:04,870 adding 'flashinfer/data/cutlass/include/cutlass/platform/platform.h' 2026-04-14T21:03:04,872 adding 'flashinfer/data/cutlass/include/cutlass/reduction/threadblock_swizzle.h' 2026-04-14T21:03:04,874 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/reduce_split_k.h' 2026-04-14T21:03:04,875 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce.h' 2026-04-14T21:03:04,877 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_contiguous.h' 2026-04-14T21:03:04,879 adding 'flashinfer/data/cutlass/include/cutlass/reduction/device/tensor_reduce_affine_strided.h' 2026-04-14T21:03:04,882 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_softmax_final.h' 2026-04-14T21:03:04,883 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/reduce_split_k.h' 2026-04-14T21:03:04,886 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_contiguous.h' 2026-04-14T21:03:04,888 adding 'flashinfer/data/cutlass/include/cutlass/reduction/kernel/tensor_reduce_affine_strided.h' 2026-04-14T21:03:04,890 adding 'flashinfer/data/cutlass/include/cutlass/reduction/thread/reduce.h' 2026-04-14T21:03:04,892 adding 'flashinfer/data/cutlass/include/cutlass/reduction/thread/reduction_operators.h' 2026-04-14T21:03:04,894 adding 'flashinfer/data/cutlass/include/cutlass/thread/matrix.h' 2026-04-14T21:03:04,898 adding 'flashinfer/data/cutlass/include/cutlass/transform/pitch_linear_thread_map.h' 2026-04-14T21:03:04,901 adding 'flashinfer/data/cutlass/include/cutlass/transform/collective/sm90_wgmma_transpose.hpp' 2026-04-14T21:03:04,904 adding 'flashinfer/data/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp' 2026-04-14T21:03:04,906 adding 'flashinfer/data/cutlass/include/cutlass/transform/kernel/filter_format_transformer.hpp' 2026-04-14T21:03:04,909 adding 'flashinfer/data/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp' 2026-04-14T21:03:04,911 adding 'flashinfer/data/cutlass/include/cutlass/transform/kernel/sparse_gemm_compressor.hpp' 2026-04-14T21:03:04,913 adding 'flashinfer/data/cutlass/include/cutlass/transform/thread/transpose.h' 2026-04-14T21:03:04,915 adding 'flashinfer/data/cutlass/include/cutlass/transform/thread/unary_op.h' 2026-04-14T21:03:04,917 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_iterator.h' 2026-04-14T21:03:04,921 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_access_iterator.h' 2026-04-14T21:03:04,924 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/ell_predicated_tile_iterator.h' 2026-04-14T21:03:04,927 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_access_iterator.h' 2026-04-14T21:03:04,929 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_scale_bias_vector_iterator.h' 2026-04-14T21:03:04,933 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator.h' 2026-04-14T21:03:04,936 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_2dthreadtile.h' 2026-04-14T21:03:04,938 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_params.h' 2026-04-14T21:03:04,941 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_access_iterator_triangular_matrix.h' 2026-04-14T21:03:04,945 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator.h' 2026-04-14T21:03:04,948 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_2dthreadtile.h' 2026-04-14T21:03:04,950 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_tile_iterator_triangular_matrix.h' 2026-04-14T21:03:04,952 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/predicated_vector_access_iterator.h' 2026-04-14T21:03:04,954 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_scale_bias_vector_access_iterator.h' 2026-04-14T21:03:04,955 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator.h' 2026-04-14T21:03:04,957 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear.h' 2026-04-14T21:03:04,959 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear_direct_conv.h' 2026-04-14T21:03:04,961 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op.h' 2026-04-14T21:03:04,964 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op_sm80.h' 2026-04-14T21:03:04,966 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator.h' 2026-04-14T21:03:04,968 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear.h' 2026-04-14T21:03:04,970 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear_2dthreadtile.h' 2026-04-14T21:03:04,972 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op.h' 2026-04-14T21:03:04,975 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op_sm70.h' 2026-04-14T21:03:04,977 adding 'flashinfer/data/cutlass/include/cutlass/transform/threadblock/vector_iterator.h' 2026-04-14T21:03:04,979 adding 'flashinfer/data/cutlass/include/cutlass/transform/warp/vector_fragment_iterator.h' 2026-04-14T21:03:04,981 adding 'flashinfer/data/cutlass/python/setup_cutlass.py' 2026-04-14T21:03:04,982 adding 'flashinfer/data/cutlass/python/setup_library.py' 2026-04-14T21:03:04,984 adding 'flashinfer/data/cutlass/python/setup_pycute.py' 2026-04-14T21:03:04,986 adding 'flashinfer/data/cutlass/python/CuTeDSL/prep_editable_install.py' 2026-04-14T21:03:04,988 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/__init__.py' 2026-04-14T21:03:04,990 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/impl_utils.py' 2026-04-14T21:03:04,992 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/torch.py' 2026-04-14T21:03:04,994 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/__init__.py' 2026-04-14T21:03:04,995 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/arch.py' 2026-04-14T21:03:04,998 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_helpers.py' 2026-04-14T21:03:05,009 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/ast_preprocessor.py' 2026-04-14T21:03:05,012 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/cache_helpers.py' 2026-04-14T21:03:05,014 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/common.py' 2026-04-14T21:03:05,017 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/compiler.py' 2026-04-14T21:03:05,026 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/dsl.py' 2026-04-14T21:03:05,029 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/env_manager.py' 2026-04-14T21:03:05,034 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/jit_executor.py' 2026-04-14T21:03:05,042 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/typing.py' 2026-04-14T21:03:05,044 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/version_info.py' 2026-04-14T21:03:05,045 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/__init__.py' 2026-04-14T21:03:05,048 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/arith.py' 2026-04-14T21:03:05,049 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/gpu.py' 2026-04-14T21:03:05,050 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/lru_cache_ir.py' 2026-04-14T21:03:05,052 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/_mlir_helpers/op.py' 2026-04-14T21:03:05,053 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/__init__.py' 2026-04-14T21:03:05,055 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/c_header_generator.py' 2026-04-14T21:03:05,057 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/export.py' 2026-04-14T21:03:05,059 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/export/external_binary_module.py' 2026-04-14T21:03:05,060 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/__init__.py' 2026-04-14T21:03:05,063 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/cuda.py' 2026-04-14T21:03:05,065 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/device_tensor.py' 2026-04-14T21:03:05,066 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/dlpack_types.py' 2026-04-14T21:03:05,068 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/jit_arg_adapters.py' 2026-04-14T21:03:05,069 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/stream_adapter.py' 2026-04-14T21:03:05,071 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/runtime/tensor_descriptor.py' 2026-04-14T21:03:05,072 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/__init__.py' 2026-04-14T21:03:05,075 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/call_provider.py' 2026-04-14T21:03:05,077 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/mlir_builder.py' 2026-04-14T21:03:05,080 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/spec.py' 2026-04-14T21:03:05,088 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/tvm_ffi_builder/tvm_ffi_builder.py' 2026-04-14T21:03:05,090 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/__init__.py' 2026-04-14T21:03:05,092 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/logger.py' 2026-04-14T21:03:05,093 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/numpy.py' 2026-04-14T21:03:05,095 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/stacktrace.py' 2026-04-14T21:03:05,096 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/timer.py' 2026-04-14T21:03:05,099 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/base_dsl/utils/tree_utils.py' 2026-04-14T21:03:05,102 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/__init__.py' 2026-04-14T21:03:05,104 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/_tvm_ffi_args_spec_converter.py' 2026-04-14T21:03:05,107 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/algorithm.py' 2026-04-14T21:03:05,112 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/atom.py' 2026-04-14T21:03:05,132 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/core.py' 2026-04-14T21:03:05,134 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/ffi.py' 2026-04-14T21:03:05,136 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/math.py' 2026-04-14T21:03:05,140 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/runtime.py' 2026-04-14T21:03:05,153 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/tensor.py' 2026-04-14T21:03:05,162 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/testing.py' 2026-04-14T21:03:05,165 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/tuple.py' 2026-04-14T21:03:05,168 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/typing.py' 2026-04-14T21:03:05,171 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/__init__.py' 2026-04-14T21:03:05,173 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/clc.py' 2026-04-14T21:03:05,175 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/elect.py' 2026-04-14T21:03:05,177 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/mbar.py' 2026-04-14T21:03:05,179 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/numeric_conversion.py' 2026-04-14T21:03:05,190 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/nvvm_wrappers.py' 2026-04-14T21:03:05,192 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/smem.py' 2026-04-14T21:03:05,194 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/arch/tmem.py' 2026-04-14T21:03:05,197 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/__init__.py' 2026-04-14T21:03:05,198 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/algorithm.py' 2026-04-14T21:03:05,200 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/core.py' 2026-04-14T21:03:05,202 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/math.py' 2026-04-14T21:03:05,204 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/memory.py' 2026-04-14T21:03:05,208 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/pipeline.py' 2026-04-14T21:03:05,210 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/experimental/utils.py' 2026-04-14T21:03:05,213 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/__init__.py' 2026-04-14T21:03:05,215 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/aot_config.py' 2026-04-14T21:03:05,217 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/c_header_generator.py' 2026-04-14T21:03:05,219 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/export.py' 2026-04-14T21:03:05,220 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/export/load.py' 2026-04-14T21:03:05,223 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/__init__.py' 2026-04-14T21:03:05,225 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/common.py' 2026-04-14T21:03:05,227 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/helpers.py' 2026-04-14T21:03:05,230 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/__init__.py' 2026-04-14T21:03:05,233 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/copy.py' 2026-04-14T21:03:05,236 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/cpasync/helpers.py' 2026-04-14T21:03:05,239 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/__init__.py' 2026-04-14T21:03:05,242 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/copy.py' 2026-04-14T21:03:05,245 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/helpers.py' 2026-04-14T21:03:05,248 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/tcgen05/mma.py' 2026-04-14T21:03:05,250 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/__init__.py' 2026-04-14T21:03:05,251 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/copy.py' 2026-04-14T21:03:05,253 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warp/mma.py' 2026-04-14T21:03:05,255 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/__init__.py' 2026-04-14T21:03:05,256 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/helpers.py' 2026-04-14T21:03:05,258 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cute/nvgpu/warpgroup/mma.py' 2026-04-14T21:03:05,260 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/__init__.py' 2026-04-14T21:03:05,262 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_jit_executor.py' 2026-04-14T21:03:05,263 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cuda_stream_adapter.py' 2026-04-14T21:03:05,272 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass.py' 2026-04-14T21:03:05,276 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/cutlass_ast_decorators.py' 2026-04-14T21:03:05,279 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/cutlass_dsl/tvm_ffi_provider.py' 2026-04-14T21:03:05,281 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/__init__.py' 2026-04-14T21:03:05,283 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/compile.py' 2026-04-14T21:03:05,284 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/ffi.py' 2026-04-14T21:03:05,286 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/primitive.py' 2026-04-14T21:03:05,288 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/testing.py' 2026-04-14T21:03:05,290 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/jax/types.py' 2026-04-14T21:03:05,292 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/__init__.py' 2026-04-14T21:03:05,294 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/helpers.py' 2026-04-14T21:03:05,298 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm100.py' 2026-04-14T21:03:05,303 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/pipeline/sm90.py' 2026-04-14T21:03:05,305 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/__init__.py' 2026-04-14T21:03:05,309 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blackwell_helpers.py' 2026-04-14T21:03:05,311 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/blockscaled_layout.py' 2026-04-14T21:03:05,312 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/distributed.py' 2026-04-14T21:03:05,314 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/dynamic_persistent_tile_scheduler.py' 2026-04-14T21:03:05,318 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_persistent_tile_scheduler.py' 2026-04-14T21:03:05,321 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/grouped_gemm_tile_scheduler_helper.py' 2026-04-14T21:03:05,322 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hardware_info.py' 2026-04-14T21:03:05,325 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/hopper_helpers.py' 2026-04-14T21:03:05,326 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/layout.py' 2026-04-14T21:03:05,330 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/mixed_input_helpers.py' 2026-04-14T21:03:05,332 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/print_latex.py' 2026-04-14T21:03:05,334 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/smem_allocator.py' 2026-04-14T21:03:05,337 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/static_persistent_tile_scheduler.py' 2026-04-14T21:03:05,339 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensor_helpers.py' 2026-04-14T21:03:05,341 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tensormap_manager.py' 2026-04-14T21:03:05,343 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/tmem_allocator.py' 2026-04-14T21:03:05,345 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm/__init__.py' 2026-04-14T21:03:05,348 adding 'flashinfer/data/cutlass/python/CuTeDSL/cutlass/utils/gemm/sm100.py' 2026-04-14T21:03:05,351 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/__init__.py' 2026-04-14T21:03:05,354 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/library_defaults.py' 2026-04-14T21:03:05,356 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/shape.py' 2026-04-14T21:03:05,357 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/swizzle.py' 2026-04-14T21:03:05,359 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/__init__.py' 2026-04-14T21:03:05,361 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/arguments.py' 2026-04-14T21:03:05,364 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/c_types.py' 2026-04-14T21:03:05,366 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/compiler.py' 2026-04-14T21:03:05,369 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/conv2d_operation.py' 2026-04-14T21:03:05,372 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/epilogue.py' 2026-04-14T21:03:05,374 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/frontend.py' 2026-04-14T21:03:05,381 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/gemm_operation.py' 2026-04-14T21:03:05,384 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/library.py' 2026-04-14T21:03:05,385 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/memory_manager.py' 2026-04-14T21:03:05,387 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/operation.py' 2026-04-14T21:03:05,389 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/reduction_operation.py' 2026-04-14T21:03:05,390 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/type_hint.py' 2026-04-14T21:03:05,392 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/__init__.py' 2026-04-14T21:03:05,394 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/epilogue.py' 2026-04-14T21:03:05,396 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/__init__.py' 2026-04-14T21:03:05,398 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/emitter_base.py' 2026-04-14T21:03:05,399 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_emitter.py' 2026-04-14T21:03:05,401 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm100_nodes.py' 2026-04-14T21:03:05,402 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_emitter.py' 2026-04-14T21:03:05,403 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm80_nodes.py' 2026-04-14T21:03:05,405 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_emitter.py' 2026-04-14T21:03:05,407 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/backend/sm90_nodes.py' 2026-04-14T21:03:05,409 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/__init__.py' 2026-04-14T21:03:05,411 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/frontend_base.py' 2026-04-14T21:03:05,412 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/frontend/python_ast.py' 2026-04-14T21:03:05,414 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/__init__.py' 2026-04-14T21:03:05,416 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/compute_nodes.py' 2026-04-14T21:03:05,418 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/dag_ir.py' 2026-04-14T21:03:05,420 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_algorithm.py' 2026-04-14T21:03:05,422 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/layout_nodes.py' 2026-04-14T21:03:05,423 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/load_nodes.py' 2026-04-14T21:03:05,425 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/node.py' 2026-04-14T21:03:05,427 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/store_nodes.py' 2026-04-14T21:03:05,429 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/ir/tensor.py' 2026-04-14T21:03:05,431 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/__init__.py' 2026-04-14T21:03:05,432 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/graph_drawer.py' 2026-04-14T21:03:05,434 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_argument_type.py' 2026-04-14T21:03:05,435 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_dag_2_tree.py' 2026-04-14T21:03:05,437 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_fix_element_d.py' 2026-04-14T21:03:05,438 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_get_impl.py' 2026-04-14T21:03:05,440 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_layout_elimination.py' 2026-04-14T21:03:05,442 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_manager.py' 2026-04-14T21:03:05,443 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_no_op_elimination.py' 2026-04-14T21:03:05,444 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_preprocess_red.py' 2026-04-14T21:03:05,446 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/pass_shape_type_propagation.py' 2026-04-14T21:03:05,448 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/smem_size_calculator.py' 2026-04-14T21:03:05,449 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/evt/passes/util.py' 2026-04-14T21:03:05,451 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/__init__.py' 2026-04-14T21:03:05,452 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/backend/utils/device.py' 2026-04-14T21:03:05,454 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/emit/__init__.py' 2026-04-14T21:03:05,456 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/emit/common.py' 2026-04-14T21:03:05,460 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/emit/pytorch.py' 2026-04-14T21:03:05,462 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/__init__.py' 2026-04-14T21:03:05,463 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/epilogue.py' 2026-04-14T21:03:05,465 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/epilogue/evt_ops.py' 2026-04-14T21:03:05,467 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/__init__.py' 2026-04-14T21:03:05,471 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/conv.py' 2026-04-14T21:03:05,475 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm.py' 2026-04-14T21:03:05,478 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/gemm_grouped.py' 2026-04-14T21:03:05,480 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/op/op.py' 2026-04-14T21:03:05,482 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/__init__.py' 2026-04-14T21:03:05,484 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/check.py' 2026-04-14T21:03:05,486 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/datatypes.py' 2026-04-14T21:03:05,487 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/lazy_import.py' 2026-04-14T21:03:05,489 adding 'flashinfer/data/cutlass/python/cutlass_cppgen/utils/profiler.py' 2026-04-14T21:03:05,491 adding 'flashinfer/data/cutlass/python/cutlass_library/__init__.py' 2026-04-14T21:03:05,494 adding 'flashinfer/data/cutlass/python/cutlass_library/conv2d_operation.py' 2026-04-14T21:03:05,497 adding 'flashinfer/data/cutlass/python/cutlass_library/conv3d_operation.py' 2026-04-14T21:03:05,499 adding 'flashinfer/data/cutlass/python/cutlass_library/conv3x_emitter.py' 2026-04-14T21:03:05,503 adding 'flashinfer/data/cutlass/python/cutlass_library/emit_kernel_listing.py' 2026-04-14T21:03:05,509 adding 'flashinfer/data/cutlass/python/cutlass_library/gemm_operation.py' 2026-04-14T21:03:05,536 adding 'flashinfer/data/cutlass/python/cutlass_library/generator.py' 2026-04-14T21:03:05,541 adding 'flashinfer/data/cutlass/python/cutlass_library/heuristics.py' 2026-04-14T21:03:05,543 adding 'flashinfer/data/cutlass/python/cutlass_library/heuristics_provider.py' 2026-04-14T21:03:05,549 adding 'flashinfer/data/cutlass/python/cutlass_library/library.py' 2026-04-14T21:03:05,553 adding 'flashinfer/data/cutlass/python/cutlass_library/manifest.py' 2026-04-14T21:03:05,555 adding 'flashinfer/data/cutlass/python/cutlass_library/rank_2k_operation.py' 2026-04-14T21:03:05,557 adding 'flashinfer/data/cutlass/python/cutlass_library/rank_k_operation.py' 2026-04-14T21:03:05,559 adding 'flashinfer/data/cutlass/python/cutlass_library/sm100_shapes.py' 2026-04-14T21:03:05,561 adding 'flashinfer/data/cutlass/python/cutlass_library/sm100_utils.py' 2026-04-14T21:03:05,563 adding 'flashinfer/data/cutlass/python/cutlass_library/sm90_shapes.py' 2026-04-14T21:03:05,566 adding 'flashinfer/data/cutlass/python/cutlass_library/sm90_utils.py' 2026-04-14T21:03:05,569 adding 'flashinfer/data/cutlass/python/cutlass_library/symm_operation.py' 2026-04-14T21:03:05,571 adding 'flashinfer/data/cutlass/python/cutlass_library/trmm_operation.py' 2026-04-14T21:03:05,574 adding 'flashinfer/data/cutlass/python/docs_src/source/conf.py' 2026-04-14T21:03:05,575 adding 'flashinfer/data/cutlass/python/pycute/__init__.py' 2026-04-14T21:03:05,577 adding 'flashinfer/data/cutlass/python/pycute/int_tuple.py' 2026-04-14T21:03:05,579 adding 'flashinfer/data/cutlass/python/pycute/layout.py' 2026-04-14T21:03:05,581 adding 'flashinfer/data/cutlass/python/pycute/swizzle.py' 2026-04-14T21:03:05,582 adding 'flashinfer/data/cutlass/python/pycute/typing.py' 2026-04-14T21:03:05,585 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/conftest.py' 2026-04-14T21:03:05,587 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/conftest.py' 2026-04-14T21:03:05,589 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_blockscaled_gemm_persistent_prefetch.py' 2026-04-14T21:03:05,591 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_dense_gemm_persistent_prefetch.py' 2026-04-14T21:03:05,592 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_rmsnorm.py' 2026-04-14T21:03:05,594 adding 'flashinfer/data/cutlass/test/examples/CuTeDSL/sm_100a/test_tutorial_gemm.py' 2026-04-14T21:03:05,596 adding 'flashinfer/data/cutlass/test/python/cutlass/installation.py' 2026-04-14T21:03:05,599 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_problem_sizes.py' 2026-04-14T21:03:05,601 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_sm80.py' 2026-04-14T21:03:05,603 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/conv2d_test_utils.py' 2026-04-14T21:03:05,604 adding 'flashinfer/data/cutlass/test/python/cutlass/conv2d/run_all_tests.py' 2026-04-14T21:03:05,607 adding 'flashinfer/data/cutlass/test/python/cutlass/emit/pytorch.py' 2026-04-14T21:03:05,609 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_compute_sm80_90.py' 2026-04-14T21:03:05,610 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_layout_sm80_90.py' 2026-04-14T21:03:05,612 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_load_sm80_90.py' 2026-04-14T21:03:05,614 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_mixed_sm80_90.py' 2026-04-14T21:03:05,615 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/evt_store_sm80_90.py' 2026-04-14T21:03:05,617 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/run_all_tests.py' 2026-04-14T21:03:05,619 adding 'flashinfer/data/cutlass/test/python/cutlass/evt/utils/evt_testbed.py' 2026-04-14T21:03:05,621 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_batched.py' 2026-04-14T21:03:05,623 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm80.py' 2026-04-14T21:03:05,625 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f16_sm90.py' 2026-04-14T21:03:05,626 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f32_sm80.py' 2026-04-14T21:03:05,628 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm80.py' 2026-04-14T21:03:05,629 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f64_sm90.py' 2026-04-14T21:03:05,630 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_f8_sm90.py' 2026-04-14T21:03:05,632 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_mixed_sm80.py' 2026-04-14T21:03:05,633 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm80.py' 2026-04-14T21:03:05,634 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_s8_sm90.py' 2026-04-14T21:03:05,637 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/gemm_testbed.py' 2026-04-14T21:03:05,638 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/run_all_tests.py' 2026-04-14T21:03:05,640 adding 'flashinfer/data/cutlass/test/python/cutlass/gemm/utils.py' 2026-04-14T21:03:05,642 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/conv2d_interface.py' 2026-04-14T21:03:05,644 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/evt_interface.py' 2026-04-14T21:03:05,646 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/gemm_interface.py' 2026-04-14T21:03:05,648 adding 'flashinfer/data/cutlass/test/python/cutlass/interface/utils.py' 2026-04-14T21:03:05,650 adding 'flashinfer/data/cutlass/test/python/pycute/run_all_tests.py' 2026-04-14T21:03:05,651 adding 'flashinfer/data/cutlass/test/python/pycute/test_coalesce.py' 2026-04-14T21:03:05,653 adding 'flashinfer/data/cutlass/test/python/pycute/test_complement.py' 2026-04-14T21:03:05,654 adding 'flashinfer/data/cutlass/test/python/pycute/test_composition.py' 2026-04-14T21:03:05,655 adding 'flashinfer/data/cutlass/test/python/pycute/test_int_tuple.py' 2026-04-14T21:03:05,657 adding 'flashinfer/data/cutlass/test/python/pycute/test_left_inverse.py' 2026-04-14T21:03:05,658 adding 'flashinfer/data/cutlass/test/python/pycute/test_right_inverse.py' 2026-04-14T21:03:05,660 adding 'flashinfer/data/cutlass/test/python/pycute/test_typing.py' 2026-04-14T21:03:05,663 adding 'flashinfer/data/cutlass/test/unit/gemm/device/simt_sm50.py' 2026-04-14T21:03:05,667 adding 'flashinfer/data/cutlass/test/utils/test_sharding.py' 2026-04-14T21:03:05,671 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/GPU_Clock.hpp' 2026-04-14T21:03:05,673 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/command_line.h' 2026-04-14T21:03:05,675 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/cublas_wrappers.hpp' 2026-04-14T21:03:05,676 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/debug.h' 2026-04-14T21:03:05,678 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_dump.h' 2026-04-14T21:03:05,680 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_groupnorm.h' 2026-04-14T21:03:05,683 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_layernorm.h' 2026-04-14T21:03:05,685 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_memory.h' 2026-04-14T21:03:05,686 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nchw_to_nhwc.h' 2026-04-14T21:03:05,688 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_padding.h' 2026-04-14T21:03:05,691 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_pooling.h' 2026-04-14T21:03:05,693 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_nhwc_to_nchw.h' 2026-04-14T21:03:05,694 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_rmsnorm.h' 2026-04-14T21:03:05,696 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/device_utils.h' 2026-04-14T21:03:05,697 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/distribution.h' 2026-04-14T21:03:05,699 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/exceptions.h' 2026-04-14T21:03:05,701 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/gett_commandline.hpp' 2026-04-14T21:03:05,702 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/helper_cuda.hpp' 2026-04-14T21:03:05,704 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_reorder.h' 2026-04-14T21:03:05,706 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor.h' 2026-04-14T21:03:05,709 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_tensor_planar_complex.h' 2026-04-14T21:03:05,710 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/host_uncompress.h' 2026-04-14T21:03:05,712 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/index_sequence.h' 2026-04-14T21:03:05,715 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/mixed_dtype_utils.hpp' 2026-04-14T21:03:05,717 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/packed_stride.hpp' 2026-04-14T21:03:05,719 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/print_error.hpp' 2026-04-14T21:03:05,721 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/tensor_view_io.h' 2026-04-14T21:03:05,723 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/type_traits.h' 2026-04-14T21:03:05,725 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/inner_product.h' 2026-04-14T21:03:05,727 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/detail/linear_to_coordinate.h' 2026-04-14T21:03:05,731 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/convolution.h' 2026-04-14T21:03:05,733 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm.h' 2026-04-14T21:03:05,734 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_complex.h' 2026-04-14T21:03:05,736 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gemm_planar_complex.h' 2026-04-14T21:03:05,738 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/gett.hpp' 2026-04-14T21:03:05,740 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/rank_2k_complex.h' 2026-04-14T21:03:05,741 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_compare.h' 2026-04-14T21:03:05,745 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h' 2026-04-14T21:03:05,747 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h' 2026-04-14T21:03:05,749 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_reduce.h' 2026-04-14T21:03:05,750 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/tensor_relu.h' 2026-04-14T21:03:05,753 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/gemm.h' 2026-04-14T21:03:05,754 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_elementwise.h' 2026-04-14T21:03:05,756 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h' 2026-04-14T21:03:05,758 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/device/thread/gemm.h' 2026-04-14T21:03:05,761 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/conv.hpp' 2026-04-14T21:03:05,764 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/convolution.h' 2026-04-14T21:03:05,765 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/error_metrics.h' 2026-04-14T21:03:05,767 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm.h' 2026-04-14T21:03:05,769 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_complex.h' 2026-04-14T21:03:05,771 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gemm_planar_complex.h' 2026-04-14T21:03:05,775 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/gett.hpp' 2026-04-14T21:03:05,777 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k.h' 2026-04-14T21:03:05,779 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_2k_complex.h' 2026-04-14T21:03:05,780 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/rank_k_complex.h' 2026-04-14T21:03:05,782 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm.h' 2026-04-14T21:03:05,784 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/symm_complex.h' 2026-04-14T21:03:05,786 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.h' 2026-04-14T21:03:05,787 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_compare.hpp' 2026-04-14T21:03:05,789 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_copy.h' 2026-04-14T21:03:05,790 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_elementwise.h' 2026-04-14T21:03:05,794 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h' 2026-04-14T21:03:05,796 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.hpp' 2026-04-14T21:03:05,798 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_foreach.h' 2026-04-14T21:03:05,799 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_norm.h' 2026-04-14T21:03:05,801 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.h' 2026-04-14T21:03:05,802 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/tensor_reduce.hpp' 2026-04-14T21:03:05,804 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm.h' 2026-04-14T21:03:05,806 adding 'flashinfer/data/cutlass/tools/util/include/cutlass/util/reference/host/trmm_complex.h' 2026-04-14T21:03:05,808 adding 'flashinfer/data/cutlass/tools/util/scripts/split_test_cmake.py' 2026-04-14T21:03:05,811 adding 'flashinfer/data/include/flashinfer/activation.cuh' 2026-04-14T21:03:05,814 adding 'flashinfer/data/include/flashinfer/air_top_p.cuh' 2026-04-14T21:03:05,815 adding 'flashinfer/data/include/flashinfer/allocator.h' 2026-04-14T21:03:05,817 adding 'flashinfer/data/include/flashinfer/arch_condition.h' 2026-04-14T21:03:05,818 adding 'flashinfer/data/include/flashinfer/attention_impl.cuh' 2026-04-14T21:03:05,820 adding 'flashinfer/data/include/flashinfer/concat_mla.cuh' 2026-04-14T21:03:05,821 adding 'flashinfer/data/include/flashinfer/cp_async.cuh' 2026-04-14T21:03:05,822 adding 'flashinfer/data/include/flashinfer/cubin_loader.h' 2026-04-14T21:03:05,824 adding 'flashinfer/data/include/flashinfer/cutlass_utils.cuh' 2026-04-14T21:03:05,825 adding 'flashinfer/data/include/flashinfer/exception.h' 2026-04-14T21:03:05,827 adding 'flashinfer/data/include/flashinfer/fastdiv.cuh' 2026-04-14T21:03:05,828 adding 'flashinfer/data/include/flashinfer/fp16.h' 2026-04-14T21:03:05,830 adding 'flashinfer/data/include/flashinfer/fp4_layout.cuh' 2026-04-14T21:03:05,831 adding 'flashinfer/data/include/flashinfer/frag_layout_swizzle.cuh' 2026-04-14T21:03:05,833 adding 'flashinfer/data/include/flashinfer/layout.cuh' 2026-04-14T21:03:05,834 adding 'flashinfer/data/include/flashinfer/logging.h' 2026-04-14T21:03:05,835 adding 'flashinfer/data/include/flashinfer/math.cuh' 2026-04-14T21:03:05,838 adding 'flashinfer/data/include/flashinfer/mma.cuh' 2026-04-14T21:03:05,841 adding 'flashinfer/data/include/flashinfer/norm.cuh' 2026-04-14T21:03:05,844 adding 'flashinfer/data/include/flashinfer/page.cuh' 2026-04-14T21:03:05,846 adding 'flashinfer/data/include/flashinfer/permuted_smem.cuh' 2026-04-14T21:03:05,851 adding 'flashinfer/data/include/flashinfer/pos_enc.cuh' 2026-04-14T21:03:05,853 adding 'flashinfer/data/include/flashinfer/profiler.cuh' 2026-04-14T21:03:05,855 adding 'flashinfer/data/include/flashinfer/quantization.cuh' 2026-04-14T21:03:05,861 adding 'flashinfer/data/include/flashinfer/sampling.cuh' 2026-04-14T21:03:05,874 adding 'flashinfer/data/include/flashinfer/topk.cuh' 2026-04-14T21:03:05,877 adding 'flashinfer/data/include/flashinfer/utils.cuh' 2026-04-14T21:03:05,882 adding 'flashinfer/data/include/flashinfer/vec_dtypes.cuh' 2026-04-14T21:03:05,886 adding 'flashinfer/data/include/flashinfer/attention/batch_pod.cuh' 2026-04-14T21:03:05,889 adding 'flashinfer/data/include/flashinfer/attention/cascade.cuh' 2026-04-14T21:03:05,891 adding 'flashinfer/data/include/flashinfer/attention/cutlass_mla.cuh' 2026-04-14T21:03:05,896 adding 'flashinfer/data/include/flashinfer/attention/decode.cuh' 2026-04-14T21:03:05,899 adding 'flashinfer/data/include/flashinfer/attention/decode_mla_cute_sm80.cuh' 2026-04-14T21:03:05,900 adding 'flashinfer/data/include/flashinfer/attention/default_decode_params.cuh' 2026-04-14T21:03:05,902 adding 'flashinfer/data/include/flashinfer/attention/default_prefill_params.cuh' 2026-04-14T21:03:05,904 adding 'flashinfer/data/include/flashinfer/attention/heap.h' 2026-04-14T21:03:05,905 adding 'flashinfer/data/include/flashinfer/attention/hopper.cuh' 2026-04-14T21:03:05,907 adding 'flashinfer/data/include/flashinfer/attention/mask.cuh' 2026-04-14T21:03:05,911 adding 'flashinfer/data/include/flashinfer/attention/mla.cuh' 2026-04-14T21:03:05,915 adding 'flashinfer/data/include/flashinfer/attention/mla_hopper.cuh' 2026-04-14T21:03:05,917 adding 'flashinfer/data/include/flashinfer/attention/mla_params.cuh' 2026-04-14T21:03:05,921 adding 'flashinfer/data/include/flashinfer/attention/persistent.cuh' 2026-04-14T21:03:05,922 adding 'flashinfer/data/include/flashinfer/attention/persistent_template.cuh' 2026-04-14T21:03:05,925 adding 'flashinfer/data/include/flashinfer/attention/pod.cuh' 2026-04-14T21:03:05,934 adding 'flashinfer/data/include/flashinfer/attention/prefill.cuh' 2026-04-14T21:03:05,942 adding 'flashinfer/data/include/flashinfer/attention/scheduler.cuh' 2026-04-14T21:03:05,944 adding 'flashinfer/data/include/flashinfer/attention/state.cuh' 2026-04-14T21:03:05,946 adding 'flashinfer/data/include/flashinfer/attention/variant_helper.cuh' 2026-04-14T21:03:05,947 adding 'flashinfer/data/include/flashinfer/attention/variants.cuh' 2026-04-14T21:03:05,950 adding 'flashinfer/data/include/flashinfer/attention/blackwell/fmha_cutlass_sm100.cuh' 2026-04-14T21:03:05,951 adding 'flashinfer/data/include/flashinfer/attention/blackwell/plan.cuh' 2026-04-14T21:03:05,954 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_common.hpp' 2026-04-14T21:03:05,955 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/fmha_fusion.hpp' 2026-04-14T21:03:05,957 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_epilogue_tma_warpspecialized.hpp' 2026-04-14T21:03:05,961 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_fwd_mainloop_tma_warpspecialized.hpp' 2026-04-14T21:03:05,963 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_epilogue_warpspecialized.hpp' 2026-04-14T21:03:05,968 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_gen_mainloop_warpspecialized.hpp' 2026-04-14T21:03:05,971 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_cpasync_warpspecialized.hpp' 2026-04-14T21:03:05,972 adding 'flashinfer/data/include/flashinfer/attention/blackwell/collective/sm100_fmha_load_tma_warpspecialized.hpp' 2026-04-14T21:03:05,975 adding 'flashinfer/data/include/flashinfer/attention/blackwell/common/pow_2.hpp' 2026-04-14T21:03:05,977 adding 'flashinfer/data/include/flashinfer/attention/blackwell/device/fmha.hpp' 2026-04-14T21:03:05,979 adding 'flashinfer/data/include/flashinfer/attention/blackwell/device/sm100_mla.hpp' 2026-04-14T21:03:05,981 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_options.hpp' 2026-04-14T21:03:05,982 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/fmha_tile_scheduler.hpp' 2026-04-14T21:03:05,984 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/gather_tensor.hpp' 2026-04-14T21:03:05,986 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_fwd_kernel_tma_warpspecialized.hpp' 2026-04-14T21:03:05,989 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_gen_kernel_warpspecialized.hpp' 2026-04-14T21:03:05,991 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_reduction.hpp' 2026-04-14T21:03:05,998 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_fmha_mla_tma_warpspecialized.hpp' 2026-04-14T21:03:06,000 adding 'flashinfer/data/include/flashinfer/attention/blackwell/kernel/sm100_mla_tile_scheduler.hpp' 2026-04-14T21:03:06,002 adding 'flashinfer/data/include/flashinfer/attention/hopper/attention_updater.cuh' 2026-04-14T21:03:06,004 adding 'flashinfer/data/include/flashinfer/attention/hopper/default_params.cuh' 2026-04-14T21:03:06,006 adding 'flashinfer/data/include/flashinfer/attention/hopper/epilogue.cuh' 2026-04-14T21:03:06,007 adding 'flashinfer/data/include/flashinfer/attention/hopper/kernel_traits.cuh' 2026-04-14T21:03:06,009 adding 'flashinfer/data/include/flashinfer/attention/hopper/mainloop.cuh' 2026-04-14T21:03:06,011 adding 'flashinfer/data/include/flashinfer/attention/hopper/mainloop_mma.cuh' 2026-04-14T21:03:06,013 adding 'flashinfer/data/include/flashinfer/attention/hopper/named_barrier.cuh' 2026-04-14T21:03:06,016 adding 'flashinfer/data/include/flashinfer/attention/hopper/prefill_sm90.cuh' 2026-04-14T21:03:06,018 adding 'flashinfer/data/include/flashinfer/attention/hopper/sparse_mainloop.cuh' 2026-04-14T21:03:06,020 adding 'flashinfer/data/include/flashinfer/attention/hopper/tile_scheduler.cuh' 2026-04-14T21:03:06,022 adding 'flashinfer/data/include/flashinfer/attention/hopper/utils.cuh' 2026-04-14T21:03:06,023 adding 'flashinfer/data/include/flashinfer/attention/hopper/variant_helper.cuh' 2026-04-14T21:03:06,025 adding 'flashinfer/data/include/flashinfer/attention/hopper/variants.cuh' 2026-04-14T21:03:06,027 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/epilogue.cuh' 2026-04-14T21:03:06,029 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/kernel_traits.cuh' 2026-04-14T21:03:06,031 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_load.cuh' 2026-04-14T21:03:06,033 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_mma.cuh' 2026-04-14T21:03:06,036 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/mainloop_sparse_load.cuh' 2026-04-14T21:03:06,039 adding 'flashinfer/data/include/flashinfer/attention/hopper/quantization/prefill_sm90.cuh' 2026-04-14T21:03:06,046 adding 'flashinfer/data/include/flashinfer/comm/trtllm_allreduce.cuh' 2026-04-14T21:03:06,052 adding 'flashinfer/data/include/flashinfer/comm/trtllm_allreduce_fusion.cuh' 2026-04-14T21:03:06,056 adding 'flashinfer/data/include/flashinfer/comm/trtllm_alltoall.cuh' 2026-04-14T21:03:06,058 adding 'flashinfer/data/include/flashinfer/comm/trtllm_alltoall_prepare.cuh' 2026-04-14T21:03:06,063 adding 'flashinfer/data/include/flashinfer/comm/trtllm_mnnvl_allreduce.cuh' 2026-04-14T21:03:06,068 adding 'flashinfer/data/include/flashinfer/comm/trtllm_moe_allreduce_fusion.cuh' 2026-04-14T21:03:06,071 adding 'flashinfer/data/include/flashinfer/comm/vllm_custom_all_reduce.cuh' 2026-04-14T21:03:06,073 adding 'flashinfer/data/include/flashinfer/flat/common.hpp' 2026-04-14T21:03:06,075 adding 'flashinfer/data/include/flashinfer/flat/cute_ext.hpp' 2026-04-14T21:03:06,076 adding 'flashinfer/data/include/flashinfer/flat/debug.hpp' 2026-04-14T21:03:06,077 adding 'flashinfer/data/include/flashinfer/flat/math.hpp' 2026-04-14T21:03:06,079 adding 'flashinfer/data/include/flashinfer/flat/math_order_barrier.hpp' 2026-04-14T21:03:06,080 adding 'flashinfer/data/include/flashinfer/flat/type_traits.hpp' 2026-04-14T21:03:06,081 adding 'flashinfer/data/include/flashinfer/flat/unused.hpp' 2026-04-14T21:03:06,085 adding 'flashinfer/data/include/flashinfer/flat/ampere/collective/flat_collective_inverse.hpp' 2026-04-14T21:03:06,086 adding 'flashinfer/data/include/flashinfer/flat/ampere/collective/flat_collective_load.hpp' 2026-04-14T21:03:06,089 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_load.hpp' 2026-04-14T21:03:06,091 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_store.hpp' 2026-04-14T21:03:06,096 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_collective_tma_warpspecialized_delta_rule.hpp' 2026-04-14T21:03:06,098 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_common.hpp' 2026-04-14T21:03:06,100 adding 'flashinfer/data/include/flashinfer/flat/hopper/collective/flat_named_barriers.hpp' 2026-04-14T21:03:06,102 adding 'flashinfer/data/include/flashinfer/flat/hopper/device/device_universal.hpp' 2026-04-14T21:03:06,104 adding 'flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_kernel_builder_delta_rule.hpp' 2026-04-14T21:03:06,106 adding 'flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_kernel_tma_warpspecialized_delta_rule.hpp' 2026-04-14T21:03:06,108 adding 'flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_options.hpp' 2026-04-14T21:03:06,109 adding 'flashinfer/data/include/flashinfer/flat/hopper/kernel/flat_tile_scheduler.hpp' 2026-04-14T21:03:06,111 adding 'flashinfer/data/include/flashinfer/flat/prefill/prefill_kernel.hpp' 2026-04-14T21:03:06,112 adding 'flashinfer/data/include/flashinfer/flat/prefill/prefill_kernel_delta_rule_sm90.cuh' 2026-04-14T21:03:06,115 adding 'flashinfer/data/include/flashinfer/gemm/bf16_gemm_cutlass.h' 2026-04-14T21:03:06,116 adding 'flashinfer/data/include/flashinfer/gemm/bf16_gemm_cutlass_template.h' 2026-04-14T21:03:06,118 adding 'flashinfer/data/include/flashinfer/gemm/bf16_gemm_template_sm100.h' 2026-04-14T21:03:06,120 adding 'flashinfer/data/include/flashinfer/gemm/bmm_fp8.cuh' 2026-04-14T21:03:06,122 adding 'flashinfer/data/include/flashinfer/gemm/cutlass_gemm_configs.h' 2026-04-14T21:03:06,124 adding 'flashinfer/data/include/flashinfer/gemm/dsv3_router_gemm.cuh' 2026-04-14T21:03:06,125 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass.h' 2026-04-14T21:03:06,127 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template.h' 2026-04-14T21:03:06,129 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm103.h' 2026-04-14T21:03:06,131 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_cutlass_template_sm120.h' 2026-04-14T21:03:06,134 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm100.h' 2026-04-14T21:03:06,136 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm103.h' 2026-04-14T21:03:06,139 adding 'flashinfer/data/include/flashinfer/gemm/fp4_gemm_template_sm120.h' 2026-04-14T21:03:06,140 adding 'flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass.h' 2026-04-14T21:03:06,142 adding 'flashinfer/data/include/flashinfer/gemm/fp8_gemm_cutlass_template.h' 2026-04-14T21:03:06,144 adding 'flashinfer/data/include/flashinfer/gemm/fp8_gemm_template_sm100.h' 2026-04-14T21:03:06,146 adding 'flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm100.cuh' 2026-04-14T21:03:06,147 adding 'flashinfer/data/include/flashinfer/gemm/gemm_groupwise_sm120.cuh' 2026-04-14T21:03:06,149 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm.cuh' 2026-04-14T21:03:06,151 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm100.cuh' 2026-04-14T21:03:06,153 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_fp8_groupwise_sm120.cuh' 2026-04-14T21:03:06,154 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_lora.cuh' 2026-04-14T21:03:06,156 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm100.cuh' 2026-04-14T21:03:06,159 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_mxfp4_groupwise_sm120.cuh' 2026-04-14T21:03:06,162 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_nvfp4_groupwise_sm120.cuh' 2026-04-14T21:03:06,164 adding 'flashinfer/data/include/flashinfer/gemm/group_gemm_sm90.cuh' 2026-04-14T21:03:06,165 adding 'flashinfer/data/include/flashinfer/gemm/group_gemv.cuh' 2026-04-14T21:03:06,166 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass.h' 2026-04-14T21:03:06,168 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass_template.h' 2026-04-14T21:03:06,170 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_cutlass_template_sm120.h' 2026-04-14T21:03:06,172 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_template_sm100.h' 2026-04-14T21:03:06,174 adding 'flashinfer/data/include/flashinfer/gemm/mxfp8_gemm_template_sm120.h' 2026-04-14T21:03:06,184 adding 'flashinfer/data/include/flashinfer/gemm/tgv_gemm.cuh' 2026-04-14T21:03:06,185 adding 'flashinfer/data/include/flashinfer/gemm/tgv_gemm_configs.h' 2026-04-14T21:03:06,187 adding 'flashinfer/data/include/flashinfer/gemm/tgv_gemm_template.h' 2026-04-14T21:03:06,189 adding 'flashinfer/data/include/flashinfer/mamba/common.cuh' 2026-04-14T21:03:06,191 adding 'flashinfer/data/include/flashinfer/mamba/conversion.cuh' 2026-04-14T21:03:06,192 adding 'flashinfer/data/include/flashinfer/mamba/create_tensor_map.cuh' 2026-04-14T21:03:06,195 adding 'flashinfer/data/include/flashinfer/mamba/invoke_selective_state_update_mtp.cuh' 2026-04-14T21:03:06,197 adding 'flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_async_horizontal.cuh' 2026-04-14T21:03:06,201 adding 'flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_horizontal.cuh' 2026-04-14T21:03:06,204 adding 'flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_simple.cuh' 2026-04-14T21:03:06,207 adding 'flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_mtp_vertical.cuh' 2026-04-14T21:03:06,213 adding 'flashinfer/data/include/flashinfer/mamba/kernel_selective_state_update_stp.cuh' 2026-04-14T21:03:06,214 adding 'flashinfer/data/include/flashinfer/mamba/selective_state_update.cuh' 2026-04-14T21:03:06,216 adding 'flashinfer/data/include/flashinfer/mamba/seq_chunk_cumsum.cuh' 2026-04-14T21:03:06,218 adding 'flashinfer/data/include/flashinfer/mamba/ssu_mtp_common.cuh' 2026-04-14T21:03:06,221 adding 'flashinfer/data/include/flashinfer/norm/ln_fwd_silu_kernel.cuh' 2026-04-14T21:03:06,228 adding 'flashinfer/data/include/flashinfer/norm/ln_silu_headers.cuh' 2026-04-14T21:03:06,231 adding 'flashinfer/data/include/flashinfer/trtllm/common.h' 2026-04-14T21:03:06,233 adding 'flashinfer/data/include/flashinfer/trtllm/batched_gemm/KernelRunner.h' 2026-04-14T21:03:06,236 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Fallbacks.cuh' 2026-04-14T21:03:06,237 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaBf16Wrapper.h' 2026-04-14T21:03:06,238 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaFp8Utils.h' 2026-04-14T21:03:06,240 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaTypeUtils.cuh' 2026-04-14T21:03:06,243 adding 'flashinfer/data/include/flashinfer/trtllm/common/cudaUtils.h' 2026-04-14T21:03:06,245 adding 'flashinfer/data/include/flashinfer/trtllm/common/reduceKernelUtils.cuh' 2026-04-14T21:03:06,247 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/decoder_impl_common.h' 2026-04-14T21:03:06,249 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/decoder_params.h' 2026-04-14T21:03:06,254 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaKernels.cuh' 2026-04-14T21:03:06,256 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaReduction.h' 2026-04-14T21:03:06,257 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunner.cuh' 2026-04-14T21:03:06,259 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/fmhaRunnerParams.h' 2026-04-14T21:03:06,264 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/kernelParams.h' 2026-04-14T21:03:06,265 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/kernelUtils.h' 2026-04-14T21:03:06,267 adding 'flashinfer/data/include/flashinfer/trtllm/fmha/lse.cuh' 2026-04-14T21:03:06,269 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/DevKernel.h' 2026-04-14T21:03:06,271 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/IntFastDiv.h' 2026-04-14T21:03:06,274 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingCustomPolicy.cuh' 2026-04-14T21:03:06,276 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingDevKernel.h' 2026-04-14T21:03:06,281 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.cuh' 2026-04-14T21:03:06,283 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernel.h' 2026-04-14T21:03:06,285 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/RoutingKernelTopK.cuh' 2026-04-14T21:03:06,287 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/noAuxTcKernels.h' 2026-04-14T21:03:06,289 adding 'flashinfer/data/include/flashinfer/trtllm/fused_moe/runner.h' 2026-04-14T21:03:06,292 adding 'flashinfer/data/spdlog/include/spdlog/async.h' 2026-04-14T21:03:06,294 adding 'flashinfer/data/spdlog/include/spdlog/async_logger-inl.h' 2026-04-14T21:03:06,295 adding 'flashinfer/data/spdlog/include/spdlog/async_logger.h' 2026-04-14T21:03:06,296 adding 'flashinfer/data/spdlog/include/spdlog/common-inl.h' 2026-04-14T21:03:06,298 adding 'flashinfer/data/spdlog/include/spdlog/common.h' 2026-04-14T21:03:06,300 adding 'flashinfer/data/spdlog/include/spdlog/formatter.h' 2026-04-14T21:03:06,301 adding 'flashinfer/data/spdlog/include/spdlog/fwd.h' 2026-04-14T21:03:06,302 adding 'flashinfer/data/spdlog/include/spdlog/logger-inl.h' 2026-04-14T21:03:06,304 adding 'flashinfer/data/spdlog/include/spdlog/logger.h' 2026-04-14T21:03:06,306 adding 'flashinfer/data/spdlog/include/spdlog/mdc.h' 2026-04-14T21:03:06,310 adding 'flashinfer/data/spdlog/include/spdlog/pattern_formatter-inl.h' 2026-04-14T21:03:06,311 adding 'flashinfer/data/spdlog/include/spdlog/pattern_formatter.h' 2026-04-14T21:03:06,313 adding 'flashinfer/data/spdlog/include/spdlog/spdlog-inl.h' 2026-04-14T21:03:06,315 adding 'flashinfer/data/spdlog/include/spdlog/spdlog.h' 2026-04-14T21:03:06,316 adding 'flashinfer/data/spdlog/include/spdlog/stopwatch.h' 2026-04-14T21:03:06,317 adding 'flashinfer/data/spdlog/include/spdlog/tweakme.h' 2026-04-14T21:03:06,319 adding 'flashinfer/data/spdlog/include/spdlog/version.h' 2026-04-14T21:03:06,321 adding 'flashinfer/data/spdlog/include/spdlog/cfg/argv.h' 2026-04-14T21:03:06,322 adding 'flashinfer/data/spdlog/include/spdlog/cfg/env.h' 2026-04-14T21:03:06,323 adding 'flashinfer/data/spdlog/include/spdlog/cfg/helpers-inl.h' 2026-04-14T21:03:06,325 adding 'flashinfer/data/spdlog/include/spdlog/cfg/helpers.h' 2026-04-14T21:03:06,327 adding 'flashinfer/data/spdlog/include/spdlog/details/backtracer-inl.h' 2026-04-14T21:03:06,329 adding 'flashinfer/data/spdlog/include/spdlog/details/backtracer.h' 2026-04-14T21:03:06,330 adding 'flashinfer/data/spdlog/include/spdlog/details/circular_q.h' 2026-04-14T21:03:06,331 adding 'flashinfer/data/spdlog/include/spdlog/details/console_globals.h' 2026-04-14T21:03:06,333 adding 'flashinfer/data/spdlog/include/spdlog/details/file_helper-inl.h' 2026-04-14T21:03:06,334 adding 'flashinfer/data/spdlog/include/spdlog/details/file_helper.h' 2026-04-14T21:03:06,336 adding 'flashinfer/data/spdlog/include/spdlog/details/fmt_helper.h' 2026-04-14T21:03:06,337 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg-inl.h' 2026-04-14T21:03:06,338 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg.h' 2026-04-14T21:03:06,339 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer-inl.h' 2026-04-14T21:03:06,340 adding 'flashinfer/data/spdlog/include/spdlog/details/log_msg_buffer.h' 2026-04-14T21:03:06,342 adding 'flashinfer/data/spdlog/include/spdlog/details/mpmc_blocking_q.h' 2026-04-14T21:03:06,343 adding 'flashinfer/data/spdlog/include/spdlog/details/null_mutex.h' 2026-04-14T21:03:06,345 adding 'flashinfer/data/spdlog/include/spdlog/details/os-inl.h' 2026-04-14T21:03:06,347 adding 'flashinfer/data/spdlog/include/spdlog/details/os.h' 2026-04-14T21:03:06,348 adding 'flashinfer/data/spdlog/include/spdlog/details/periodic_worker-inl.h' 2026-04-14T21:03:06,349 adding 'flashinfer/data/spdlog/include/spdlog/details/periodic_worker.h' 2026-04-14T21:03:06,351 adding 'flashinfer/data/spdlog/include/spdlog/details/registry-inl.h' 2026-04-14T21:03:06,352 adding 'flashinfer/data/spdlog/include/spdlog/details/registry.h' 2026-04-14T21:03:06,354 adding 'flashinfer/data/spdlog/include/spdlog/details/synchronous_factory.h' 2026-04-14T21:03:06,355 adding 'flashinfer/data/spdlog/include/spdlog/details/tcp_client-windows.h' 2026-04-14T21:03:06,357 adding 'flashinfer/data/spdlog/include/spdlog/details/tcp_client.h' 2026-04-14T21:03:06,358 adding 'flashinfer/data/spdlog/include/spdlog/details/thread_pool-inl.h' 2026-04-14T21:03:06,359 adding 'flashinfer/data/spdlog/include/spdlog/details/thread_pool.h' 2026-04-14T21:03:06,361 adding 'flashinfer/data/spdlog/include/spdlog/details/udp_client-windows.h' 2026-04-14T21:03:06,362 adding 'flashinfer/data/spdlog/include/spdlog/details/udp_client.h' 2026-04-14T21:03:06,363 adding 'flashinfer/data/spdlog/include/spdlog/details/windows_include.h' 2026-04-14T21:03:06,365 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bin_to_hex.h' 2026-04-14T21:03:06,367 adding 'flashinfer/data/spdlog/include/spdlog/fmt/chrono.h' 2026-04-14T21:03:06,368 adding 'flashinfer/data/spdlog/include/spdlog/fmt/compile.h' 2026-04-14T21:03:06,369 adding 'flashinfer/data/spdlog/include/spdlog/fmt/fmt.h' 2026-04-14T21:03:06,370 adding 'flashinfer/data/spdlog/include/spdlog/fmt/ostr.h' 2026-04-14T21:03:06,372 adding 'flashinfer/data/spdlog/include/spdlog/fmt/ranges.h' 2026-04-14T21:03:06,373 adding 'flashinfer/data/spdlog/include/spdlog/fmt/std.h' 2026-04-14T21:03:06,374 adding 'flashinfer/data/spdlog/include/spdlog/fmt/xchar.h' 2026-04-14T21:03:06,377 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/args.h' 2026-04-14T21:03:06,384 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/chrono.h' 2026-04-14T21:03:06,388 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/color.h' 2026-04-14T21:03:06,390 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/compile.h' 2026-04-14T21:03:06,403 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/core.h' 2026-04-14T21:03:06,405 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/fmt.license.rst' 2026-04-14T21:03:06,414 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/format-inl.h' 2026-04-14T21:03:06,435 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/format.h' 2026-04-14T21:03:06,437 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/locale.h' 2026-04-14T21:03:06,439 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/os.h' 2026-04-14T21:03:06,441 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/ostream.h' 2026-04-14T21:03:06,444 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/printf.h' 2026-04-14T21:03:06,447 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/ranges.h' 2026-04-14T21:03:06,449 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/std.h' 2026-04-14T21:03:06,451 adding 'flashinfer/data/spdlog/include/spdlog/fmt/bundled/xchar.h' 2026-04-14T21:03:06,454 adding 'flashinfer/data/spdlog/include/spdlog/sinks/android_sink.h' 2026-04-14T21:03:06,455 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink-inl.h' 2026-04-14T21:03:06,457 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ansicolor_sink.h' 2026-04-14T21:03:06,458 adding 'flashinfer/data/spdlog/include/spdlog/sinks/base_sink-inl.h' 2026-04-14T21:03:06,459 adding 'flashinfer/data/spdlog/include/spdlog/sinks/base_sink.h' 2026-04-14T21:03:06,460 adding 'flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink-inl.h' 2026-04-14T21:03:06,462 adding 'flashinfer/data/spdlog/include/spdlog/sinks/basic_file_sink.h' 2026-04-14T21:03:06,463 adding 'flashinfer/data/spdlog/include/spdlog/sinks/callback_sink.h' 2026-04-14T21:03:06,465 adding 'flashinfer/data/spdlog/include/spdlog/sinks/daily_file_sink.h' 2026-04-14T21:03:06,466 adding 'flashinfer/data/spdlog/include/spdlog/sinks/dist_sink.h' 2026-04-14T21:03:06,467 adding 'flashinfer/data/spdlog/include/spdlog/sinks/dup_filter_sink.h' 2026-04-14T21:03:06,469 adding 'flashinfer/data/spdlog/include/spdlog/sinks/hourly_file_sink.h' 2026-04-14T21:03:06,470 adding 'flashinfer/data/spdlog/include/spdlog/sinks/kafka_sink.h' 2026-04-14T21:03:06,472 adding 'flashinfer/data/spdlog/include/spdlog/sinks/mongo_sink.h' 2026-04-14T21:03:06,473 adding 'flashinfer/data/spdlog/include/spdlog/sinks/msvc_sink.h' 2026-04-14T21:03:06,474 adding 'flashinfer/data/spdlog/include/spdlog/sinks/null_sink.h' 2026-04-14T21:03:06,476 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ostream_sink.h' 2026-04-14T21:03:06,477 adding 'flashinfer/data/spdlog/include/spdlog/sinks/qt_sinks.h' 2026-04-14T21:03:06,479 adding 'flashinfer/data/spdlog/include/spdlog/sinks/ringbuffer_sink.h' 2026-04-14T21:03:06,480 adding 'flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink-inl.h' 2026-04-14T21:03:06,482 adding 'flashinfer/data/spdlog/include/spdlog/sinks/rotating_file_sink.h' 2026-04-14T21:03:06,483 adding 'flashinfer/data/spdlog/include/spdlog/sinks/sink-inl.h' 2026-04-14T21:03:06,484 adding 'flashinfer/data/spdlog/include/spdlog/sinks/sink.h' 2026-04-14T21:03:06,485 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks-inl.h' 2026-04-14T21:03:06,487 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_color_sinks.h' 2026-04-14T21:03:06,488 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks-inl.h' 2026-04-14T21:03:06,489 adding 'flashinfer/data/spdlog/include/spdlog/sinks/stdout_sinks.h' 2026-04-14T21:03:06,491 adding 'flashinfer/data/spdlog/include/spdlog/sinks/syslog_sink.h' 2026-04-14T21:03:06,492 adding 'flashinfer/data/spdlog/include/spdlog/sinks/systemd_sink.h' 2026-04-14T21:03:06,494 adding 'flashinfer/data/spdlog/include/spdlog/sinks/tcp_sink.h' 2026-04-14T21:03:06,495 adding 'flashinfer/data/spdlog/include/spdlog/sinks/udp_sink.h' 2026-04-14T21:03:06,497 adding 'flashinfer/data/spdlog/include/spdlog/sinks/win_eventlog_sink.h' 2026-04-14T21:03:06,498 adding 'flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink-inl.h' 2026-04-14T21:03:06,500 adding 'flashinfer/data/spdlog/include/spdlog/sinks/wincolor_sink.h' 2026-04-14T21:03:06,501 adding 'flashinfer/data/spdlog/scripts/extract_version.py' 2026-04-14T21:03:06,503 adding 'flashinfer/dsv3_ops/__init__.py' 2026-04-14T21:03:06,505 adding 'flashinfer/fused_moe/__init__.py' 2026-04-14T21:03:06,513 adding 'flashinfer/fused_moe/core.py' 2026-04-14T21:03:06,516 adding 'flashinfer/fused_moe/fused_routing_dsv3.py' 2026-04-14T21:03:06,517 adding 'flashinfer/fused_moe/utils.py' 2026-04-14T21:03:06,519 adding 'flashinfer/fused_moe/cute_dsl/__init__.py' 2026-04-14T21:03:06,522 adding 'flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py' 2026-04-14T21:03:06,525 adding 'flashinfer/fused_moe/cute_dsl/blockscaled_contiguous_grouped_gemm_finalize_fusion.py' 2026-04-14T21:03:06,529 adding 'flashinfer/fused_moe/cute_dsl/fused_moe.py' 2026-04-14T21:03:06,532 adding 'flashinfer/fused_moe/cute_dsl/moe_utils.py' 2026-04-14T21:03:06,534 adding 'flashinfer/fused_moe/cute_dsl/tuner.py' 2026-04-14T21:03:06,536 adding 'flashinfer/fused_moe/cute_dsl/blackwell/__init__.py' 2026-04-14T21:03:06,549 adding 'flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_gather_grouped_gemm_swiglu_fusion.py' 2026-04-14T21:03:06,560 adding 'flashinfer/fused_moe/cute_dsl/blackwell/blockscaled_contiguous_grouped_gemm_finalize_fusion.py' 2026-04-14T21:03:06,563 adding 'flashinfer/fused_moe/cute_dsl/blackwell/custom_pipeline.py' 2026-04-14T21:03:06,565 adding 'flashinfer/fused_moe/cute_dsl/blackwell/utils.py' 2026-04-14T21:03:06,567 adding 'flashinfer/gdn_kernels/__init__.py' 2026-04-14T21:03:06,575 adding 'flashinfer/gdn_kernels/gdn_decode_bf16_state.py' 2026-04-14T21:03:06,583 adding 'flashinfer/gdn_kernels/gdn_decode_mtp.py' 2026-04-14T21:03:06,586 adding 'flashinfer/gdn_kernels/gdn_decode_nontranspose.py' 2026-04-14T21:03:06,589 adding 'flashinfer/gdn_kernels/gdn_decode_pretranspose.py' 2026-04-14T21:03:06,591 adding 'flashinfer/gdn_kernels/blackwell/__init__.py' 2026-04-14T21:03:06,605 adding 'flashinfer/gdn_kernels/blackwell/gated_delta_net_chunked.py' 2026-04-14T21:03:06,607 adding 'flashinfer/gdn_kernels/blackwell/gated_delta_net_tile_scheduler.py' 2026-04-14T21:03:06,609 adding 'flashinfer/gdn_kernels/blackwell/gdn_prefill.py' 2026-04-14T21:03:06,611 adding 'flashinfer/gemm/__init__.py' 2026-04-14T21:03:06,633 adding 'flashinfer/gemm/gemm_base.py' 2026-04-14T21:03:06,636 adding 'flashinfer/gemm/routergemm.py' 2026-04-14T21:03:06,638 adding 'flashinfer/gemm/kernels/__init__.py' 2026-04-14T21:03:06,646 adding 'flashinfer/gemm/kernels/dense_blockscaled_gemm_sm100.py' 2026-04-14T21:03:06,656 adding 'flashinfer/gemm/kernels/dense_blockscaled_gemm_sm103.py' 2026-04-14T21:03:06,667 adding 'flashinfer/gemm/kernels/grouped_gemm_masked_blackwell.py' 2026-04-14T21:03:06,670 adding 'flashinfer/jit/__init__.py' 2026-04-14T21:03:06,671 adding 'flashinfer/jit/activation.py' 2026-04-14T21:03:06,673 adding 'flashinfer/jit/cascade.py' 2026-04-14T21:03:06,674 adding 'flashinfer/jit/comm.py' 2026-04-14T21:03:06,676 adding 'flashinfer/jit/core.py' 2026-04-14T21:03:06,678 adding 'flashinfer/jit/cpp_ext.py' 2026-04-14T21:03:06,681 adding 'flashinfer/jit/cubin_loader.py' 2026-04-14T21:03:06,682 adding 'flashinfer/jit/dsv3_optimizations.py' 2026-04-14T21:03:06,683 adding 'flashinfer/jit/env.py' 2026-04-14T21:03:06,685 adding 'flashinfer/jit/fp4_kv_dequantization.py' 2026-04-14T21:03:06,686 adding 'flashinfer/jit/fp4_kv_quantization.py' 2026-04-14T21:03:06,687 adding 'flashinfer/jit/fp4_quantization.py' 2026-04-14T21:03:06,688 adding 'flashinfer/jit/fp8_quantization.py' 2026-04-14T21:03:06,690 adding 'flashinfer/jit/fused_moe.py' 2026-04-14T21:03:06,692 adding 'flashinfer/jit/gdn.py' 2026-04-14T21:03:06,693 adding 'flashinfer/jit/mla.py' 2026-04-14T21:03:06,695 adding 'flashinfer/jit/moe_utils.py' 2026-04-14T21:03:06,697 adding 'flashinfer/jit/norm.py' 2026-04-14T21:03:06,698 adding 'flashinfer/jit/page.py' 2026-04-14T21:03:06,699 adding 'flashinfer/jit/quantization.py' 2026-04-14T21:03:06,701 adding 'flashinfer/jit/rmsnorm_silu.py' 2026-04-14T21:03:06,703 adding 'flashinfer/jit/rope.py' 2026-04-14T21:03:06,704 adding 'flashinfer/jit/sampling.py' 2026-04-14T21:03:06,705 adding 'flashinfer/jit/spdlog.py' 2026-04-14T21:03:06,706 adding 'flashinfer/jit/tinygemm2.py' 2026-04-14T21:03:06,708 adding 'flashinfer/jit/tllm_utils.py' 2026-04-14T21:03:06,710 adding 'flashinfer/jit/topk.py' 2026-04-14T21:03:06,711 adding 'flashinfer/jit/utils.py' 2026-04-14T21:03:06,713 adding 'flashinfer/jit/xqa.py' 2026-04-14T21:03:06,720 adding 'flashinfer/jit/attention/__init__.py' 2026-04-14T21:03:06,724 adding 'flashinfer/jit/attention/modules.py' 2026-04-14T21:03:06,729 adding 'flashinfer/jit/attention/utils.py' 2026-04-14T21:03:06,731 adding 'flashinfer/jit/attention/variants.py' 2026-04-14T21:03:06,736 adding 'flashinfer/jit/attention/fmha_v2/fmha_library.py' 2026-04-14T21:03:06,740 adding 'flashinfer/jit/attention/fmha_v2/generate_kernels.py' 2026-04-14T21:03:06,769 adding 'flashinfer/jit/attention/fmha_v2/generator_utils.py' 2026-04-14T21:03:06,774 adding 'flashinfer/jit/attention/fmha_v2/utils.py' 2026-04-14T21:03:06,778 adding 'flashinfer/jit/gemm/__init__.py' 2026-04-14T21:03:06,780 adding 'flashinfer/jit/gemm/core.py' 2026-04-14T21:03:06,781 adding 'flashinfer/jit/gemm/deepgemm.py' 2026-04-14T21:03:06,783 adding 'flashinfer/jit/gemm/fp8_blockscale.py' 2026-04-14T21:03:06,784 adding 'flashinfer/jit/gemm/cutlass/__init__.py' 2026-04-14T21:03:06,788 adding 'flashinfer/jit/gemm/cutlass/cutlass_library.py' 2026-04-14T21:03:06,793 adding 'flashinfer/jit/gemm/cutlass/generate_kernels.py' 2026-04-14T21:03:06,795 adding 'flashinfer/jit/mamba/__init__.py' 2026-04-14T21:03:06,796 adding 'flashinfer/jit/mamba/selective_state_update.py' 2026-04-14T21:03:06,798 adding 'flashinfer/jit/mamba/seq_chunk_cumsum.py' 2026-04-14T21:03:06,800 adding 'flashinfer/logits_processor/__init__.py' 2026-04-14T21:03:06,801 adding 'flashinfer/logits_processor/compiler.py' 2026-04-14T21:03:06,803 adding 'flashinfer/logits_processor/fusion_rules.py' 2026-04-14T21:03:06,805 adding 'flashinfer/logits_processor/legalization.py' 2026-04-14T21:03:06,806 adding 'flashinfer/logits_processor/op.py' 2026-04-14T21:03:06,808 adding 'flashinfer/logits_processor/operators.py' 2026-04-14T21:03:06,809 adding 'flashinfer/logits_processor/pipeline.py' 2026-04-14T21:03:06,811 adding 'flashinfer/logits_processor/processors.py' 2026-04-14T21:03:06,813 adding 'flashinfer/logits_processor/types.py' 2026-04-14T21:03:06,814 adding 'flashinfer/logits_processor/validators.py' 2026-04-14T21:03:06,816 adding 'flashinfer/mamba/__init__.py' 2026-04-14T21:03:06,818 adding 'flashinfer/mamba/selective_state_update.py' 2026-04-14T21:03:06,821 adding 'flashinfer/mamba/ssd_combined.py' 2026-04-14T21:03:06,835 adding 'flashinfer/mamba/ssd_kernel.py' 2026-04-14T21:03:06,838 adding 'flashinfer/mamba/ssd_tile_scheduler.py' 2026-04-14T21:03:06,839 adding 'flashinfer/mla/__init__.py' 2026-04-14T21:03:06,843 adding 'flashinfer/mla/_core.py' 2026-04-14T21:03:06,847 adding 'flashinfer/norm/__init__.py' 2026-04-14T21:03:06,850 adding 'flashinfer/norm/utils.py' 2026-04-14T21:03:06,852 adding 'flashinfer/norm/kernels/__init__.py' 2026-04-14T21:03:06,855 adding 'flashinfer/norm/kernels/fused_add_rmsnorm.py' 2026-04-14T21:03:06,857 adding 'flashinfer/norm/kernels/layernorm.py' 2026-04-14T21:03:06,861 adding 'flashinfer/norm/kernels/rmsnorm.py' 2026-04-14T21:03:06,863 adding 'flashinfer/profiler/__init__.py' 2026-04-14T21:03:06,865 adding 'flashinfer/quantization/__init__.py' 2026-04-14T21:03:06,870 adding 'flashinfer/quantization/fp4_quantization.py' 2026-04-14T21:03:06,872 adding 'flashinfer/quantization/fp8_quantization.py' 2026-04-14T21:03:06,874 adding 'flashinfer/quantization/packbits.py' 2026-04-14T21:03:06,878 adding 'flashinfer/quantization/quantization_cute_dsl_utils.py' 2026-04-14T21:03:06,880 adding 'flashinfer/quantization/kernels/__init__.py' 2026-04-14T21:03:06,884 adding 'flashinfer/quantization/kernels/mxfp4_quantize.py' 2026-04-14T21:03:06,887 adding 'flashinfer/quantization/kernels/mxfp8_quantize.py' 2026-04-14T21:03:06,893 adding 'flashinfer/quantization/kernels/nvfp4_quantize.py' 2026-04-14T21:03:06,895 adding 'flashinfer/testing/__init__.py' 2026-04-14T21:03:06,901 adding 'flashinfer/testing/utils.py' 2026-04-14T21:03:06,903 adding 'flashinfer/triton/__init__.py' 2026-04-14T21:03:06,904 adding 'flashinfer/triton/activation.py' 2026-04-14T21:03:06,906 adding 'flashinfer/triton/cascade.py' 2026-04-14T21:03:06,907 adding 'flashinfer/triton/gemm.py' 2026-04-14T21:03:06,908 adding 'flashinfer/triton/norm.py' 2026-04-14T21:03:06,910 adding 'flashinfer/triton/page.py' 2026-04-14T21:03:06,911 adding 'flashinfer/triton/sm_constraint_gemm.py' 2026-04-14T21:03:06,913 adding 'flashinfer/triton/utils.py' 2026-04-14T21:03:06,915 adding 'flashinfer/triton/kernels/__init__.py' 2026-04-14T21:03:06,916 adding 'flashinfer/triton/kernels/activation.py' 2026-04-14T21:03:06,918 adding 'flashinfer/triton/kernels/cascade.py' 2026-04-14T21:03:06,919 adding 'flashinfer/triton/kernels/norm.py' 2026-04-14T21:03:06,920 adding 'flashinfer/triton/kernels/quant.py' 2026-04-14T21:03:06,922 adding 'flashinfer/triton/kernels/sm_constraint_gemm.py' 2026-04-14T21:03:06,924 adding 'flashinfer/triton/kernels/ssd_chunk_state.py' 2026-04-14T21:03:06,925 adding 'flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_B200.py' 2026-04-14T21:03:06,927 adding 'flashinfer/tuning_configs/v0_1_trtllm_fused_moe_NVIDIA_GB200.py' 2026-04-14T21:03:06,930 adding 'flashinfer_python-0.6.8rc1.dist-info/licenses/LICENSE' 2026-04-14T21:03:06,932 adding 'flashinfer_python-0.6.8rc1.dist-info/METADATA' 2026-04-14T21:03:06,933 adding 'flashinfer_python-0.6.8rc1.dist-info/WHEEL' 2026-04-14T21:03:06,934 adding 'flashinfer_python-0.6.8rc1.dist-info/entry_points.txt' 2026-04-14T21:03:06,935 adding 'flashinfer_python-0.6.8rc1.dist-info/top_level.txt' 2026-04-14T21:03:06,977 adding 'flashinfer_python-0.6.8rc1.dist-info/RECORD' 2026-04-14T21:03:07,095 removing build/bdist.linux-armv7l/wheel 2026-04-14T21:03:07,811 Building wheel for flashinfer-python (pyproject.toml): finished with status 'done' 2026-04-14T21:03:08,040 Created wheel for flashinfer-python: filename=flashinfer_python-0.6.8rc1-py3-none-any.whl size=9383164 sha256=03391613bde22d44aa6cc1c6e53ba651f46f264a4dee2b4b6b1088a35f872db7 2026-04-14T21:03:08,042 Stored in directory: /tmp/pip-ephem-wheel-cache-r285t4ef/wheels/7b/a9/52/a1b53269a7b7838897e3dfd41299715fcec4d0e6dbea2fc30a 2026-04-14T21:03:08,123 Successfully built flashinfer-python 2026-04-14T21:03:08,356 Removed build tracker: '/tmp/pip-build-tracker-vct4xkk1'