Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Scan task once halted never continues #468

Open
MarioSob opened this issue Feb 29, 2024 · 0 comments
Open

Scan task once halted never continues #468

MarioSob opened this issue Feb 29, 2024 · 0 comments
Labels
bug Something isn't working

Comments

@MarioSob
Copy link

Device Information (please complete the following information):

  • OS: Linux x86 64bit, Ubuntu Server, Docker
  • Deployment: Docker compose
  • SIST2 Version: 3.4.1
  • Elasticsearch Version (if relevant) : 7.17.18

Describe the bug
Scan never continues once previous scan has been interrupted by closing the program. Next scan instead of continuing on files that previous scan didn't managed to complete, only operates on newly detected files. Clicking on index doesn't help. Clicking full reindex helps although rescans all files over again.

Steps To Reproduce

  1. Run docker sist2, configure ES7 backed in sist-admin, add job with OCR of images and ebooks, add schedule, create frontend.
  2. Start indexing.
  3. In admin open tasks window to see scan happening. CPU usage is high and logs indicate OCR happening.
  4. After few files scanned and still many ahead shutdown docker gently.
  5. Start the application (container) again.
  6. In admin task is not running, it looks completed.
  7. Next auto-indexing doesn't reach for files that weren't scanned before. Frontend never lets you search those files content.

Expected behavior
Next scan task "sees" files that weren't scanned and OCR them. It should go back to first file that weren't scanned by the last scan task.

Actual Behavior
Next scan task ignores actual status of the files.

Additional context
Files were mixture of jpegs and pdfs (with text and sometimes also/only images inside). Around 100 files in total in directory and subdirectories.

@MarioSob MarioSob added the bug Something isn't working label Feb 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant