Skip to content
This repository was archived by the owner on May 1, 2025. It is now read-only.

Commit 2a08824

Browse files
committed
Add RWKV models
1 parent c794017 commit 2a08824

File tree

2 files changed

+25
-0
lines changed

2 files changed

+25
-0
lines changed

README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -71,6 +71,7 @@ The following table shows the supported models with sizes and the tasks that the
7171
| GPT-NeoX | 20B | Pretrained |
7272
| GPT-Neo | 1.3B | Pretrained |
7373
| GPT-J | 6B | Pretrained |
74+
| RWKV | 169M, 430M, 1.5B, 3B, 7B, 14B 14B | Pretrained |
7475
| Incoder | 6B | Pretrained |
7576
| CodeParrot | Small-python (110M), Small-multi(110M), 1.5B | Pretrained |
7677
| CodeBERT | CodeBERT-base, UnixCoder-base, CodeBERTa-small | Pretrained |

codetf/configs/inference/causal_lm.yaml

Lines changed: 24 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -68,4 +68,28 @@ causallm-codegen2-7B-pretrained:
6868
causallm-codegen2-16B-pretrained:
6969
huggingface_url: "Salesforce/codegen2-16B"
7070
tokenizer_url: "Salesforce/codegen2-16B"
71+
max_prediction_length: 512
72+
causallm-rwkv-169M-pretrained:
73+
huggingface_url: "RWKV/rwkv-4-169m-pile"
74+
tokenizer_url: "RWKV/rwkv-4-169m-pile"
75+
max_prediction_length: 512
76+
causallm-rwkv-430M-pretrained:
77+
huggingface_url: "RWKV/rwkv-4-430m-pile"
78+
tokenizer_url: "RWKV/rwkv-4-430m-pile"
79+
max_prediction_length: 512
80+
causallm-rwkv-1.5B-pretrained:
81+
huggingface_url: "RWKV/rwkv-raven-1b5"
82+
tokenizer_url: "RWKV/rwkv-raven-1b5"
83+
max_prediction_length: 512
84+
causallm-rwkv-3B-pretrained:
85+
huggingface_url: "RWKV/rwkv-raven-3b"
86+
tokenizer_url: "RWKV/rwkv-raven-3b"
87+
max_prediction_length: 512
88+
causallm-rwkv-7B-pretrained:
89+
huggingface_url: "RWKV/rwkv-raven-7b"
90+
tokenizer_url: "RWKV/rwkv-raven-7b"
91+
max_prediction_length: 512
92+
causallm-rwkv-14B-pretrained:
93+
huggingface_url: "RWKV/rwkv-raven-14b"
94+
tokenizer_url: "RWKV/rwkv-raven-14b"
7195
max_prediction_length: 512

0 commit comments

Comments
 (0)