Error when scraping captions #8

joaanna · 2017-07-03T17:06:01Z

Hey, so far I crawled followers smoothly, but I have 2 issues:

I get this when I try to crawl the captions
python instagramcrawler.py -d data -q 'viralnova365' -c -n 10
dir_prefix: data, query: viralnova365, crawl_type: photos, number: 10, caption: True
posts: 1660, number: 10
Scraping photo links...
Number of photo_links: 25
Scraping captions...
Traceback (most recent call last):
File "instagramcrawler.py", line 297, in
main()
File "instagramcrawler.py", line 293, in main
caption=args.caption)
File "instagramcrawler.py", line 85, in crawl
self.click_and_scrape_captions(number)
File "instagramcrawler.py", line 161, in click_and_scrape_captions
FIREFOX_FIRST_POST_PATH).click()
File "/InstagramCrawler/crawl/lib/python3.4/site-packages/selenium/webdriver/remote/webdriver.py", line 313, in find_element_by_xpath
return self.find_element(by=By.XPATH, value=xpath)
File "InstagramCrawler/crawl/lib/python3.4/site-packages/selenium/webdriver/remote/webdriver.py", line 791, in find_element
'value': value})['value']
File 'InstagramCrawler/crawl/lib/python3.4/site-packages/selenium/webdriver/remote/webdriver.py", line 256, in execute
self.error_handler.check_response(response)
File "InstagramCrawler/crawl/lib/python3.4/site-packages/selenium/webdriver/remote/errorhandler.py", line 194, in check_response
raise exception_class(message, screen, stacktrace)
selenium.common.exceptions.NoSuchElementException: Message: Unable to locate element: //a[contains(@Class, '_8mlbc _vbtk2 _t5r8b')]
also I would like to crawl all the images, but it never downloades the number specifed by -n, do you have any suggestions?

tzuhsial · 2017-07-04T03:06:14Z

Hi @joaanna ,
Thank you for telling me!
I'll look into this when I have time...

tzuhsial · 2017-07-07T12:15:39Z

@joaanna
I think I fixed the path to caption, that makes captions crawlable now.
(Guess I'll have to do this everytime whenever Instagram updates)

And about the number issue,
I am still looking for a robust way to detect if new posts are loaded.
Any help is appreciated!

anfiallos · 2017-08-22T23:41:43Z

Hi. I have the same problem. Error with values on label.
FIREFOX_FIRST_POST_PATH
Any suggestion please?

anakmalank · 2018-09-29T05:31:36Z

hi, i got this problem too.
selenium.common.exceptions.NoSuchElementException: Message: Unable to locate element: //div[contains(@Class, '_8mlbc _vbtk2 _t5r8b')]

tzuhsial added bug enhancement labels Jul 7, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error when scraping captions #8

Error when scraping captions #8

joaanna commented Jul 3, 2017

tzuhsial commented Jul 4, 2017

tzuhsial commented Jul 7, 2017

anfiallos commented Aug 22, 2017

anakmalank commented Sep 29, 2018

Error when scraping captions #8

Error when scraping captions #8

Comments

joaanna commented Jul 3, 2017

tzuhsial commented Jul 4, 2017

tzuhsial commented Jul 7, 2017

anfiallos commented Aug 22, 2017

anakmalank commented Sep 29, 2018