{"author":"Francis Lam","author_email":"flam@alum.mit.edu","author_time":1711243062,"commit_time":1711243062,"committer":"GitHub","committer_email":"noreply@github.com","hash":"0145366323116568d0b1a02d3df2e14a307f00c7","message":"wmma: fix the AMD TC threads to split the first 16 threads (#3904)\n\npreviously it was incorrectly aliasing 16 into the size 8 upcast\r\non the store alias.  now it splits it properly into 8 and the\r\nremaining 2 into the correct local stride","parents":["7c3632fd1e6dd95e39fcce18687c7f1d8016b9b8"],"tree_hash":"13e1a9a770f2361b69f35fb4341cf19f79869030"}