Skip to content

v0.17.2

Choose a tag to compare

@percevalw percevalw released this 25 Jun 18:19
· 49 commits to master since this release

Changelog

Added

  • Handling intra-word linebreak as pollution : adds a pollution pattern that detects intra-word linebreak, which can then be removed in the get_text method
  • Qualifiers can process Span or Doc : this feature especially makes it easier to nest qualifiers components in other components
  • New label_weights parameter in eds.span_classifier`, which allows the user to set per label-value loss weights during training
  • New edsnlp.data.converters.MarkupToDocConverter to convert Markdown or XML-like markup to documents, which is particularly useful to create annotated documents from scratch (e.g., for testing purposes).
  • New Metrics documentation page to document the available metrics and how to use them.

Fixed

  • Various disorders/behaviors patches

Changed

  • Deduplicate spans between doc.ents and doc.spans during train: previously, a span_getter requesting entities from both ents and spans could yield duplicates.

Pull Requests

Full Changelog: v0.17.1...v0.17.2