You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Device Information (please complete the following information):
OS: Linux x86 64bit, Ubuntu Server, Docker
Deployment: Docker compose
SIST2 Version: 3.4.1
Elasticsearch Version (if relevant) : 7.17.18
Describe the bug
Scan never continues once previous scan has been interrupted by closing the program. Next scan instead of continuing on files that previous scan didn't managed to complete, only operates on newly detected files. Clicking on index doesn't help. Clicking full reindex helps although rescans all files over again.
Steps To Reproduce
Run docker sist2, configure ES7 backed in sist-admin, add job with OCR of images and ebooks, add schedule, create frontend.
Start indexing.
In admin open tasks window to see scan happening. CPU usage is high and logs indicate OCR happening.
After few files scanned and still many ahead shutdown docker gently.
Start the application (container) again.
In admin task is not running, it looks completed.
Next auto-indexing doesn't reach for files that weren't scanned before. Frontend never lets you search those files content.
Expected behavior
Next scan task "sees" files that weren't scanned and OCR them. It should go back to first file that weren't scanned by the last scan task.
Actual Behavior
Next scan task ignores actual status of the files.
Additional context
Files were mixture of jpegs and pdfs (with text and sometimes also/only images inside). Around 100 files in total in directory and subdirectories.
The text was updated successfully, but these errors were encountered:
Device Information (please complete the following information):
Linux x86 64bit, Ubuntu Server, Docker
Docker compose
3.4.1
7.17.18
Describe the bug
Scan never continues once previous scan has been interrupted by closing the program. Next scan instead of continuing on files that previous scan didn't managed to complete, only operates on newly detected files. Clicking on index doesn't help. Clicking full reindex helps although rescans all files over again.
Steps To Reproduce
Expected behavior
Next scan task "sees" files that weren't scanned and OCR them. It should go back to first file that weren't scanned by the last scan task.
Actual Behavior
Next scan task ignores actual status of the files.
Additional context
Files were mixture of jpegs and pdfs (with text and sometimes also/only images inside). Around 100 files in total in directory and subdirectories.
The text was updated successfully, but these errors were encountered: