



[1]VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need [J]. arXiv, 2023-8-2 (2023-10-27).
[2]MCCULLOCH W S, PITTS W. A logical calculus of ideas immanent in nervous activity [J]. Bulletin of Mathematical Biophysics, 1943, 5 (4): 127-147.
[3]ROSENBLATT F. The perceptron: A probabilistic model for information storage and organization in the brain [J]. Psychological Review, 1958, 65 (6): 386-408.
[4]RUMELHART D E, HINTON G E, WILLIAMS R J. Learning representations by back-propagating errors [J]. Nature, 1986, 323 (6088): 533-536.
[5]KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks [J]. NIPS, 2012 (1): 1097-1105.