Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PushShift Bug : Not all PushShift shards are active #149

Open
OdelyaBeker opened this issue Jun 13, 2024 · 1 comment
Open

PushShift Bug : Not all PushShift shards are active #149

OdelyaBeker opened this issue Jun 13, 2024 · 1 comment

Comments

@OdelyaBeker
Copy link

Every time I try to run the PushshiftAPI() it gives me the following error: pmaw.PushshiftAPIBase: Not all PushShift shards are active. Query results may be incomplete.
Do you know what I can do?

@AnjaliTajane
Copy link

The error indicates that some Pushshift API shards are temporarily down or inactive, which may lead to incomplete data being returned. This can happen due to maintenance or high server traffic. One simple approach is to wait and try again after some time, as the issue may be temporary. If you’re querying a large dataset, consider breaking your queries into smaller time periods or specific subsets of data, which could reduce the load and possibly avoid relying on inactive shards. Another strategy is to implement a retry mechanism in your code, which attempts to query again after a short delay if the error occurs. This helps mitigate the issue if it is intermittent. Additionally, check the Pushshift status page for updates on any ongoing service outages. If the issue persists, you may consider using alternative APIs like Reddit's official API or exploring other third-party libraries for retrieving Reddit data.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants