Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sampling from Large Graphs #45

Open
wagpa opened this issue Feb 22, 2023 · 1 comment
Open

Sampling from Large Graphs #45

wagpa opened this issue Feb 22, 2023 · 1 comment
Labels
source This issue contains or is a source used for the thesis

Comments

@wagpa
Copy link
Owner

wagpa commented Feb 22, 2023

https://cs.stanford.edu/people/jure/pubs/sampling-kdd06.pdf

@wagpa wagpa added the source This issue contains or is a source used for the thesis label Feb 22, 2023
@wagpa
Copy link
Owner Author

wagpa commented Feb 22, 2023

Notes

  • "Sampling strategies based on edge selection do not perform well; simple uniform random node selection performs surprisingly well. Overall, best performing methods are the ones based on random-walks and “forest fire”; they match very accurately both static as well as evolutionary graph patterns, with sample sizes down to about 15% of the original graph"
  • Scale-down goal, and Back-in-time goal
  • "One could also use ideas from graph clustering and graph partitioning. These algorithms are usually computationally intensive and do not scale well to very large graphs. Graph sampling becomes important with massive graphs, where real-world algorithms become too expensive and one has to reside to sampling. Thus simplicity of sampling is essential."

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
source This issue contains or is a source used for the thesis
Projects
None yet
Development

No branches or pull requests

1 participant