v0.17.2
Changelog
Added
- Handling intra-word linebreak as pollution : adds a pollution pattern that detects intra-word linebreak, which can then be removed in the
get_textmethod - Qualifiers can process
SpanorDoc: this feature especially makes it easier to nest qualifiers components in other components - New label_weights parameter in eds.span_classifier`, which allows the user to set per label-value loss weights during training
- New
edsnlp.data.converters.MarkupToDocConverterto convert Markdown or XML-like markup to documents, which is particularly useful to create annotated documents from scratch (e.g., for testing purposes). - New Metrics documentation page to document the available metrics and how to use them.
Fixed
- Various disorders/behaviors patches
Changed
- Deduplicate spans between doc.ents and doc.spans during train: previously, a
span_getterrequesting entities from bothentsandspanscould yield duplicates.
Pull Requests
- feat: Various patches by @Thomzoy in #391
- Metrics doc by @percevalw in #417
- chore: bump version to 0.17.2 by @percevalw in #424
Full Changelog: v0.17.1...v0.17.2