Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issues with search_comments and search_submission_comment_ids #44

Open
Jhann024 opened this issue Apr 1, 2022 · 3 comments
Open

Issues with search_comments and search_submission_comment_ids #44

Jhann024 opened this issue Apr 1, 2022 · 3 comments

Comments

@Jhann024
Copy link

Jhann024 commented Apr 1, 2022

Ive been working on a project where I am gathering large amounts of comment data and I have ran into two issues with each of these functions. The first issue I ran into when using the search_submission_comment_ids function is that when searching post comment ids, there returns zero comments for any post following November 26th 2021, as well as periodically prior to this date (although I haven't done extensive testing for prior).

Following the discovery of this issue, I attempted to remedy it by checking if the use of the prior function resulted in comment data being available, and if not, then switching to the use of the search_comments function. While this did work and I was able to find comment data following the November 26th 2021 date, every API request made using the search_comments function gave a warning of "Not all Pushshift shards are active. Query Results may be incomplete.". Upon investigation using the api.metadata_.get('Shards') command, I was getting results such as:

{'failed': 0, 'skipped': 0, 'successful': 67, 'total': 74}

If anybody has any idea for why either of these issues is occurring, or why the shard metadata shows the missing shards as neither failed nor skipped and is willing to share id greatly appreciate it.

@shaunmb98
Copy link

Having the same issue with comments after November 26th. Everything before that works fine, everything after is blank using these methods.

@spooky-ape-butt-42069
Copy link

Same. Gathering comment ids coming up empty when they really shouldn't be coming up empty.

@reinoldus
Copy link

It seems to be an API problem: pushshift/api#92

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants