FAQ
Flash attention is an option for accelerating training and inference. Only NVIDIA GPUs of Turing, Ampere, Ada, and Hopper architecture, e.g., H100, A100, RTX 3090, T4, RTX 2080, can support flash atte...
Explore
125,247 skills indexed with the new KISS metadata standard.
Flash attention is an option for accelerating training and inference. Only NVIDIA GPUs of Turing, Ampere, Ada, and Hopper architecture, e.g., H100, A100, RTX 3090, T4, RTX 2080, can support flash atte...
*.so
*.so
/test
*.swp
中文README.
Read this in English.
In order to make the contribution process as smooth as possible, we have established some
generic skill
pycache/
We are happy to accept your contributions to make this repo better and more awesome! To avoid unnecessary work on either
English | 中文
English | 中文
🇨🇳中文 | 🌐English | 📖文档/Docs | ❓提问/Issues | 💬讨论/Discussions | [⚔️
🇨🇳中文 | 🌐English | 📖文档/Docs | ❓提问/Issues | 💬讨论/Discussions | [⚔️
*/.DS_Store
generic skill
为了保证文件的完整性,请一定要检查下列文件SHA256值的一致性。
🇨🇳中文 | 🌐English | 📖文档/Docs | ❓提问/Issues | 💬讨论/Discussions | [⚔️竞技场/Ar
Version 2.0, January 2004