A separate contribution was noted where by a user established a fused GEMM for int4, and that is efficient for education with fixed sequence lengths, offering the fastest Option. Tweet from Robert Graham (@ErrataRob): nVidia is in exactly the same posture as Sun Microsystems was during the early days with the dot-com bubble. Sunshine experie