Traitement du langage naturel (NLP)

Définition

Le NLP couvre les tâches sur le texte : classification, NER, QA, résumé, traduction et génération. Modern NLP is dominated by pretrained transformers (BERT, GPT, etc.) and LLMs.

Inputs are discrete (tokens); models learn from large corpora and are then adapted via fine-tuning or prompting. RAG and agents add récupération and tools on top of NLP models for grounded QA and task completion.

Comment ça fonctionne

Text is tokenized (divisé en sous-mots ou mots) and optionally normalized. The model (par ex. BERT, GPT) processes token IDs through embeddings and transformer layers to produce contextual representations. A task output head (par ex. classifier, span predictor, or next-token decoder) maps those to the final prediction. Models sont pré-entraînés sur large corpora (masked LM or prédiction du prochain token), then fine-tuned or prompted for downstream tasks. Pipelines often combine tokenization, embedding, and task-specific heads; LLMs can do many tasks with a single model and the right prompt.

Cas d'utilisation

NLP applies to any product or pipeline that needs to understand or generate text at scale.

Machine translation, summarization, and question answering
Named entity recognition, sentiment analysis, and text classification
Chatbots, code generation, and document understanding

Traitement du langage naturel (NLP)

Définition

Comment ça fonctionne

Cas d'utilisation

Documentation externe

Voir aussi

Définition​

Comment ça fonctionne​

Cas d'utilisation​

Documentation externe​

Voir aussi​

Définition

Comment ça fonctionne

Cas d'utilisation

Documentation externe

Voir aussi