We break down
I, want, a, glass, of, orange
take o vectors
o4343, o9665, 01, o3852, o6163, o6257
go through E and get 300D e vectors
e4343, e9665, e1, e3852, e6163, e6257
feed all of them into NN layer W1,b1
which feeds into softmax W2,b2
softmax classifies between 10k possible outputs