Transformer

June 2, 2025

A neural network architecture that revolutionized natural language processing by using attention mechanisms to process sequential data more effectively than previous approaches. Transformers form the foundation of large language models like GPT and BERT. They can process all parts of a sequence simultaneously, making them more efficient and capable of capturing long-range dependencies in data.

Wikipedia

Amazon is betting on agents to win the AI race
Glossary: Bidirectional Encoder Representations from Transformers
Ant International Pushes AI Strategy with AI Platform for Fintechs
Glossary: Non-linear Activation
Glossary: Forward Propagation

Transformer

About us

Company