Fast Inference Methods in Natural Language Processing
le 5 novembre 2025
13h15Campus de Beaulieu Campus de Beaulieu Amphi P - bât. 12D
Intervention de Caio Corro enseignant-chercheur à l'INSA Rennes et rattaché au laboratoire IRISA, membre de l'équipe LinkMedia, dans le cadre des séminaires du département Informatique.
Résumé :
Modern natural language processing models are base on very large neural networks, meaning that inference is usually slow, even using modern GPUs.
In this talk, I will quickly overview several research topics that aim to leverage the architecture of modern GPUs to develop fast inference methods.
I will then quickly present two of my recent works on the topic.
References :
- Bregman Conditional Random Fields: Sequence Labeling with Parallelizable Inference Algorithms (Caio Corro, Mathieu Lacroix, Joseph Le Roux) https://arxiv.org/abs/2506.00732
- KAD: A Framework for Proxy-based Test-time Alignment with Knapsack Approximation Deferral (Ayoub Hammal, Pierre Zweigenbaum, Caio Corro)
- Thématique(s)
- Formation, Recherche - Valorisation
- Contact
Mise à jour le 17 novembre 2025