branch: master
Commits on master
- af56f0e fix HSA/KFD load for system-wide installation (#4218) 1 year ago
- 12339f6 disable cuda test in ci (#4630) 1 year ago
- 9a9963b Remove uops deepcopy from PTX (#4671) 1 year ago
- 47aba47 update Torch.gather api (#4692) 1 year ago
- 792a494 fix various examples (#4691) 1 year ago
- 30b07f3 reduce ops (#4690) 1 year ago
- a46be6c docs for transpose (#4689) 1 year ago
- 86da83f move movement op docs (#4688) 1 year ago
- 498cf3e fuzzer path search for DEFINE_ACC (#4656) 1 year ago
- f11a81f isolated test for BEAM=2 llama wrong uops toposort (#4687) 1 year ago
- 6020595 more tensor.py docs (#4686) 1 year ago
- 721f9f6 test/external/verify_kernel: fix LOGKERNS variable name in comments (#4685) 1 year ago
- f8f9756 remove File Specific Variables from env_vars.md (#4684) 1 year ago
- 225dcab prepend `_` to broadcast_shape and deepwalk (#4683) 1 year ago
- c5f5755 correctness test for multireduce nested locals (#4682) 1 year ago
- bc9be39 set timeout in search _try_compile_linearized_w_idx (#4677) 1 year ago
- d12d412 revert uops dtype in pattern matcher (#4681) 1 year ago
- acc0039 Resume fix + scheduler for non weight decay params (#4679) 1 year ago
- 0f21aa0 example kernel that triggers Memory access fault for resnet on red (#4678) 1 year ago
- 5f84cbb keep UOps.CAST in PHI-GEP fold for unmatching dtypes (#4674) 1 year ago
- 458a396 catch compile errors in uops tests (#4672) 1 year ago
- 0043249 feat: tinyboxgreen (#4366) 1 year ago
- de733d7 Multireduce Linearizer Tests (#4665) 1 year ago
- 5e3fbbb llama3 example add manual seed and log seed (#4667) 1 year ago
- 8c99cc1 remove link to old adding_new_accelerators.md (#4666) 1 year ago
- c4089d1 update BEAM_LOCAL_MAX to 1024 (#4664) 1 year ago
- 704cb1d fix conversation.py quantize (#4663) 1 year ago
- ae86132 update llama sample for mac 32 input buffer limit (#4662) 1 year ago
- 993091a loss scaler + nan fixes (#4661) 1 year ago
- b33c827 UOps.RANGE toposort spec (#4660) 1 year ago