Skip to content

Commit 53ec629

Browse files
committed
character
1 parent 0cf4abf commit 53ec629

File tree

1 file changed

+7
-10
lines changed

1 file changed

+7
-10
lines changed

README.md

+7-10
Original file line numberDiff line numberDiff line change
@@ -141,6 +141,8 @@ python gpt\show_attention.py
141141
## gpt character numpy
142142
in directory `gpt_character`
143143

144+
just 100 steps training, it will merge.
145+
144146
**Train and predict**
145147
```Bash []
146148
python gpt_character\gpt_train_english_char.py
@@ -149,21 +151,16 @@ python gpt_character\gpt_charpredict.py
149151

150152
**Result**
151153
```
152-
'm: nopqrstuv',
153-
'p: qrstuvwxy',
154-
'w: xyz abcde',
155-
'w: xyz abcde',
156-
'x: yz abcdef',
157-
'f: ghijklmno',
158-
't: uvwxyz ab',
159-
'p: qrstuvwxy',
160-
'y: z abcdefg',
161-
'w: xyz abcde'
154+
'cdefg: cdefghijklmnopqrstuvwxyzabcdefghijk',
162155
```
163156

164157
## blogs
165158
[numpy实现VIT vision transformer在MNIST-https://zhuanlan.zhihu.com/p/645326689](https://zhuanlan.zhihu.com/p/645326689)<br>
166159

160+
[transformer网络内attention使用的multi-head](https://zhuanlan.zhihu.com/p/695771281)
161+
162+
[transformer网络内attention使用的mask](https://zhuanlan.zhihu.com/p/645442074?)
163+
167164

168165
总共实现了这几个层:
169166

0 commit comments

Comments
 (0)