Abstract :
[en] The human brain’s remarkable ability to focus on relevant information amidst a sea of sensory inputs has inspired a new wave of research in artificial intelligence. In this chapter, we delve into the computational modeling of attention in machine learning, where neural networks are trained to selectively pay attention to specific parts of their input to produce accurate outputs. From natural language processing to computer vision, from recommender systems to market forecasting, attention-based models have achieved state-of-the-art performance on a wide range of tasks. We will guide readers through the history and evolution of attention in machine learning, from its early implementations to recent breakthroughs with the Transformer architecture. Through a step-by-step introduction to neural networks and sequence learning, we will explain the motivation behind computational attention, explore its implementations, and provide a comparison with human attention. With this overview of the attention landscape in machine learning, readers will gain insight into how this computational concept has transformed AI research.
Scopus citations®
without self-citations
0