Python's package to scrape Twitter's front-end easily with selenium.
This code was developed to extract information from twitter without using API as there are a limitation and costs for using official twitter API. You can extract based on your keyword and time frame (in minutes). You can extract unlimitted number of tweets.
- Python 3.6+
- Browsers(Firefox)
you can install from source code using
git clone https://github.com/HamedMinaeizaeim/twitter_scraper_without_API.git
and then run
Python setup.py install
or you can run
pip install -r requirements.txt
alternatively, you can install using PyPl :
pip install twitter_scraper_without_API
To use this library, you just need to import the TwitterScraper scraper class and then specify your keyword search. By default, it will return all tweets within a minute. You can change it to extract tweets in the last n minutes. Here is a code to do that:
from src.twitter_scraper_without_api import TwitterScraper
twitter = TwitterScraper('bitcoin')
twitter.last_n_mins = 3
twitter.fetch_data()
You can export data as json, panda (Dataframe) and csv
df = twitter.store_data('dataFrame')
csv = twitter.store_data('csv')
json = twitter.store_data('json')
There is no issue with privacy in this library and search is based on publicly avaialble information