File tree 1 file changed +7
-10
lines changed
1 file changed +7
-10
lines changed Original file line number Diff line number Diff line change @@ -141,6 +141,8 @@ python gpt\show_attention.py
141
141
## gpt character numpy
142
142
in directory ` gpt_character `
143
143
144
+ just 100 steps training, it will merge.
145
+
144
146
** Train and predict**
145
147
``` Bash []
146
148
python gpt_character\g pt_train_english_char.py
@@ -149,21 +151,16 @@ python gpt_character\gpt_charpredict.py
149
151
150
152
** Result**
151
153
```
152
- 'm: nopqrstuv',
153
- 'p: qrstuvwxy',
154
- 'w: xyz abcde',
155
- 'w: xyz abcde',
156
- 'x: yz abcdef',
157
- 'f: ghijklmno',
158
- 't: uvwxyz ab',
159
- 'p: qrstuvwxy',
160
- 'y: z abcdefg',
161
- 'w: xyz abcde'
154
+ 'cdefg: cdefghijklmnopqrstuvwxyzabcdefghijk',
162
155
```
163
156
164
157
## blogs
165
158
[ numpy实现VIT vision transformer在MNIST-https://zhuanlan.zhihu.com/p/645326689 ] ( https://zhuanlan.zhihu.com/p/645326689 ) <br >
166
159
160
+ [ transformer网络内attention使用的multi-head] ( https://zhuanlan.zhihu.com/p/695771281 )
161
+
162
+ [ transformer网络内attention使用的mask] ( https://zhuanlan.zhihu.com/p/645442074? )
163
+
167
164
168
165
总共实现了这几个层:
169
166
You can’t perform that action at this time.
0 commit comments