[PROD] Ingest Appeal Docs #2007

szabozoltan69 · 2024-01-17T15:33:03Z

Issue

Recently the ingest_appeal_docs job does not run fine.
Without a header hack (of personal cookie data) the scraping of www.ifrc.org/appeals/ gives:
'reason': 'Forbidden',
'status': 403
(Also pip install brotlipy is needed for the successful decompression of the data.)

The bigger issue is that why is that failure invisible in cronjob items? Normally an erroneous run should be seen there and give a big warning message.

@batpad @thenav56

szabozoltan69 · 2024-01-18T09:32:03Z

We could use such API endpoints instead of scraping:
https://go-api.ifrc.org/api/PublicSiteAppeals?Appealnumber=MDRKZ012&Hidden=false

tovari · 2024-01-23T15:38:42Z

@arunissun, would you mind to compare staging and prod data of the 'appeal_document' endpoint?

szabozoltan69 · 2024-01-23T16:31:49Z

Some differences can be; prod and staging count:

postgres=> select count(*) from api_appealdocument;
 count 
-------
  8126

postgres=> select count(*) from api_appealdocument;
 count 
-------
  8133

arunissun · 2024-01-23T18:42:28Z

@tovari I will check the staging and prod data for appeal documents

tovari · 2024-01-26T09:34:30Z

Thanks @arunissun for the analysis. Looks like the scraper doesn't work well anymore on prod (there are missing docs from 2024).

szabozoltan69 self-assigned this Jan 17, 2024

szabozoltan69 added the Must Fix label Jan 17, 2024

szabozoltan69 mentioned this issue Jan 22, 2024

Add no-scraping appealdocument reader #2014

Merged

tovari closed this as completed Mar 4, 2024

nanometrenat mentioned this issue Jan 28, 2025

Can't easily differentiate between which appeal document is which type IFRCGo/go-web-app#1640

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PROD] Ingest Appeal Docs #2007

[PROD] Ingest Appeal Docs #2007

szabozoltan69 commented Jan 17, 2024 •

edited

Loading

szabozoltan69 commented Jan 18, 2024

tovari commented Jan 23, 2024

szabozoltan69 commented Jan 23, 2024

arunissun commented Jan 23, 2024

tovari commented Jan 26, 2024

[PROD] Ingest Appeal Docs #2007

[PROD] Ingest Appeal Docs #2007

Comments

szabozoltan69 commented Jan 17, 2024 • edited Loading

Issue

szabozoltan69 commented Jan 18, 2024

tovari commented Jan 23, 2024

szabozoltan69 commented Jan 23, 2024

arunissun commented Jan 23, 2024

tovari commented Jan 26, 2024

szabozoltan69 commented Jan 17, 2024 •

edited

Loading