Matcher Example - Spacy Typing CST Test
Loading…
Matcher Example — Spacy Code
Matches specific token patterns in text using spaCy Matcher.
import spacy
from spacy.matcher import Matcher
nlp = spacy.load('en_core_web_sm')
doc = nlp('I love NLP and machine learning')
matcher = Matcher(nlp.vocab)
pattern = [{'LOWER':'nlp'}]
matcher.add('NLP_PATTERN', [pattern])
matches = matcher(doc)
for match_id, start, end in matches:
print(doc[start:end].text)Spacy Language Guide
spaCy is an open-source Python library for advanced natural language processing (NLP). It provides efficient tools for text parsing, tokenization, named entity recognition, part-of-speech tagging, and integration with machine learning workflows.
Primary Use Cases
- ▸Tokenization, lemmatization, and text normalization
- ▸Named entity recognition (NER) and part-of-speech tagging
- ▸Dependency parsing and syntactic analysis
- ▸Text classification and sentiment analysis
- ▸Integration with machine learning pipelines for NLP tasks
Notable Features
- ▸Industrial-strength performance and speed
- ▸Pre-trained models for multiple languages
- ▸Rule-based matching and custom pipelines
- ▸Integration with deep learning frameworks
- ▸Extensible with custom components and vectors
Origin & Creator
spaCy was created by Matthew Honnibal and Ines Montani in 2015, aiming to provide industrial-strength NLP in Python with speed and accuracy.
Industrial Note
spaCy is widely used in chatbots, text analytics, sentiment analysis, information extraction, recommendation systems, and any application that requires structured NLP pipelines.