Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Idea] Not recording "viewing time" by bots #6703

Closed
walt93 opened this issue Jun 2, 2022 · 6 comments
Closed

[Idea] Not recording "viewing time" by bots #6703

walt93 opened this issue Jun 2, 2022 · 6 comments

Comments

@walt93
Copy link
Collaborator

walt93 commented Jun 2, 2022

Is your feature request related to a problem? Please describe.
As an amusing side note, I've read in online forums that Eric Dubay, very popular Flat Earth video maker, is a CIA disinformation operation.

I inadvertently uploaded the most recent Eric Dubay Flat Earth video (deceptively titled), and within 90 minutes, my site was swarmed with thirty bots watching the video over-and-over-and-over, driving that garbage up to #7 of all time on my site before I disabled it.

The bots are continuing to move through the site, on some random journey. They're easy to spot. They do this (check the time, the bot is watching the video at 2X playback speed):

Screen Shot 2022-06-02 at 4 02 27 PM

New upload. 3 views. One bot looping...

Describe the solution you'd like

  1. Lets stop logging viewing minutes from their IP address after the length of the video is exceeded. When this condition occurs, record IP address/video ID/timestamp in a table, and expire the entry after 24 hours.
  2. Let's detect this bot behavior and log their IP address in another table. I'd like to map this bot network and traceroute them. 😈
@DanielnetoDotCom
Copy link
Member

As I can see in the code it does not suppose to save bots views

if(isBot()){
die('Bot Not Allowed');
}

but the 1 day time long, maybe means a video in loop

@walt93
Copy link
Collaborator Author

walt93 commented Jun 4, 2022

Daniel, these aren't normal web scraper bots.

These bots purposefully inflate view counts. They watch at 2x speed, and they hit the play button again. And again. And again.

Any human watching a video 12 times in 24 hours at 2x speed, we should not record either. Once is enough.

I'm surprised and impressed that we have bots in the ecosystem that do this TBH.

@DanielnetoDotCom
Copy link
Member

I am not sure how to deal with them.

one option is if they do have a specific user-agent that we can block

@walt93
Copy link
Collaborator Author

walt93 commented Jun 8, 2022

How would I get their user agent? I think the swarm has quieted down/stopped now…

One way to fix is just based on their behavior (watching over-and-over at 2X speed):

  • after user watches full duration of video,
  • stop logging view minutes for 24 hours (also - stop serving VAST ads - I'm testing monetizing with them)

@JoshWho
Copy link
Contributor

JoshWho commented Jun 8, 2022

Only solution would be to not log guest views. Bots can be made to look like humans using chrome easily nowadays. You will be fighting a uphill battle trying to figure them all out.

Another solution would be to somehow include a captcha code before the player loads.

For more security and better bot defense, Install crowdsec and the IPtables bouncer, then get all the configs for it installed, for it and it will cut down on a huge portion of known bots and new bots it detects. https://crowdsec.net/?mtm_campaign=JoshWho

@DanielnetoDotCom
Copy link
Member

you can get the user agents on the apache access logs

@walt93 walt93 closed this as completed Jun 11, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants