Skip to content

Conversation

@clokep
Copy link
Contributor

@clokep clokep commented Dec 11, 2025

Use beautiulsoup4 instead of lxml for URL previews. This offers some nicer APIs when parsing HTML and avoids using llibxml, which is unmaintained.

I haven’t done a full regression against commonly previewed sites, but I expect this will give similar (or better) results.

beautiulsoup also handles decoding the charset for us, which is less custom code.

@clokep clokep requested a review from a team as a code owner December 11, 2025 13:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant