Skip to content

Commit f3ce8ca

Browse files
authored
fix: ensure the decoded text from the document is utf-8 compat (#223)
resolves #222 --------- Signed-off-by: Anupam Kumar <[email protected]>
1 parent e31bfc0 commit f3ce8ca

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

.github/workflows/integration-test.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -61,7 +61,7 @@ jobs:
6161
# do not stop on another job's failure
6262
fail-fast: false
6363
matrix:
64-
php-versions: [ '8.1' ]
64+
php-versions: [ '8.2' ]
6565
databases: [ 'pgsql' ]
6666
server-versions: [ 'stable32', 'master' ]
6767

context_chat_backend/chain/ingest/doc_loader.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -129,7 +129,7 @@ def decode_source(source: UploadFile) -> str | None:
129129
if _loader_map.get(mimetype):
130130
result = _loader_map[mimetype](source.file)
131131
source.file.close()
132-
return result
132+
return result.encode('utf-8', 'ignore').decode('utf-8', 'ignore')
133133

134134
result = source.file.read().decode('utf-8', 'ignore')
135135
source.file.close()

0 commit comments

Comments
 (0)