To enable more open-source research on instruction following large language models, we use generate 52K instruction-followng demonstrations using OpenAI's text-davinci-003 model.

Feb 1, 2026

General

PromptBeginner5 minmarkdownQuality: 45

Alpaca Model Card

Organization developing the model

Feb 1, 2026

General

PromptBeginner5 minmarkdownQuality: 45

repos:

- repo: https://github.com/pre-commit/pre-commit-hooks

Feb 1, 2026

General

PromptBeginner5 minmarkdownQuality: 68

!# LLaMA Factory

Feb 1, 2026

General

PromptBeginner5 minmarkdownQuality: 68

!# LLaMA Factory

Feb 1, 2026

General

PromptBeginner5 minmarkdownQuality: 45

Byte-compiled / optimized / DLL files

pycache/

Feb 1, 2026

General

PromptBeginner5 minmarkdownQuality: 37

.vscode

.git

Feb 1, 2026

General

PromptBeginner5 minmarkdownQuality: 41

Auto detect text files and perform LF normalization

* text=auto

Feb 1, 2026

General

PromptBeginner5 minmarkdownQuality: 68

Tokenization

注：作为术语的“tokenization”在中文中尚无共识的概念对应，本文档采用英文表达以利说明。

Feb 1, 2026

General

PromptBeginner5 minmarkdownQuality: 68

Untitled Skill

中文&nbsp ｜ &nbspEnglish&nbsp ｜ &nbsp日本語｜ &nbspFrançais ｜ &nbspEspañol

Feb 1, 2026

General

PromptBeginner5 minmarkdownQuality: 63

Introducing Qwen-7B: Open foundation and human-aligned models (of the state-of-the-arts)

Large language models have recently attracted an extremely large amount of

Feb 1, 2026

General

PromptBeginner5 minmarkdownQuality: 68

Tokenization

Qwen-7B uses BPE tokenization on UTF-8 bytes using the tiktoken package.

Feb 1, 2026

General

PromptBeginner5 minmarkdownQuality: 62

トークン化

Qwen-7B は tiktoken パッケージを使用して、UTF-8 バイトを BPE トークン化します。

Feb 1, 2026

General

PromptBeginner5 minmarkdownQuality: 68

Untitled Skill

中文&nbsp ｜ &nbspEnglish&nbsp ｜ &nbsp日本語｜ &nbspFrançais ｜ &nbspEspañol

Feb 1, 2026

General

PromptBeginner5 minmarkdownQuality: 68

Untitled Skill

中文&nbsp ｜ &nbspEnglish&nbsp ｜ &nbsp日本語&nbsp ｜ &nbspFrançais ｜ &nbspEspañol

Feb 1, 2026

General

PromptBeginner5 minmarkdownQuality: 68

Untitled Skill

中文&nbsp ｜ &nbspEnglish&nbsp ｜ &nbsp日本語｜ &nbspFrançais ｜ &nbspEspañol

Feb 1, 2026

General

PromptBeginner5 minmarkdownQuality: 50

FAQ

Flash attention is an option for accelerating training and inference. Only NVIDIA GPUs of Turing, Ampere, Ada, and Hopper architecture, e.g., H100, A100, RTX 3090, T4, RTX 2080, can support flash atte...

Feb 1, 2026

General

PromptBeginner5 minmarkdownQuality: 68

Untitled Skill

中文&nbsp ｜ &nbspEnglish&nbsp ｜ &nbsp日本語｜ &nbspFrançais ｜ &nbspEspañol

Feb 1, 2026

General

PromptBeginner5 minmarkdownQuality: 50

FAQ

flash attention是一个用于加速模型训练推理的可选项，且仅适用于Turing、Ampere、Ada、Hopper架构的Nvidia GPU显卡（如H100、A100、RTX 3090、T4、RTX 2080），您可以在不安装flash attention的情况下正常使用模型进行推理。

Feb 1, 2026