Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings
Discussion options

Our org has a requirement for detecting spanish dates that are not in perfect format like dd/mm/yyyy (Predefined DateRecognizer is more then enough for that). But stuff like - "El acontecimiento histórico tuvo lugar el 23 de abril de 2021.". The english translation of it ("The historical event took place on April 23, 2021.") works quite well with en-core-web-lg, spacy identifies it well.

For spanish I am using es-core-news-lg version 3.8.0, but no luck with any kind of dates with months like the one above. Is the model not trained on such date datasets?

The following is my setup -

import spacy

nlp = spacy.load("es_core_news_lg")

doc = nlp("El acontecimiento histórico tuvo lugar el 23 de abril de 2021.")


print(doc.ents)
for ent in doc.ents:
    print(ent.text, ent.label_)
You must be logged in to vote

Replies: 0 comments

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
1 participant
Morty Proxy This is a proxified and sanitized view of the page, visit original site.