-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Days with no CRAN downloads #54
Comments
Maybe related to this: the download counts for 16th and 17th of Jan. are 0 as well. cranlogs::cran_downloads(
packages = "ggplot2",
from = "2020-01-12",
to = Sys.Date()
)
#> date count package
#> 1 2020-01-12 23692 ggplot2
#> 2 2020-01-13 41793 ggplot2
#> 3 2020-01-14 42412 ggplot2
#> 4 2020-01-15 40575 ggplot2
#> 5 2020-01-16 0 ggplot2
#> 6 2020-01-17 0 ggplot2
#> 7 2020-01-18 19643 ggplot2
#> 8 2020-01-19 0 ggplot2 Created on 2020-01-19 by the reprex package (v0.3.0.9001) |
I have fixed most of these, except for the ones in 2012, for which my parser fails, so I'll need to take a closer look to these.... |
These three days are really missing, because the 2012-12-29 file contains the data for 2012-12-26, etc., but then from 2013-01-01 the files names actually refer to the correct day. So these three days are lost forever. IDK if we should document this somewhere or do something else about it. |
Use a warning() to flag those dates in cranlogs::cran_downloads()? |
Yeah, possibly. |
Logs for 2012, which start on Oct 1, need some TLC. They are fixable but the last three days of 2012 do indeed seem to be lost.
Details later if you want. FWIW, with 'packageRank' 0.3.0.9026, you can check these with:
|
Thanks! I don't think there is much to fix, I don't actually use the filenames when updating the db, only the data in the files. |
That's interesting (is that part of the code on GitHub?). I actually make use of the filenames. So I can "fix" it on my end. But do you think this would be something worth informing RStudio about? |
Yeah, it is here: https://github.com/r-hub/cranlogs.app/blob/master/db/update.sh These logs are gone, I am pretty sure, so there is nothing anyone can do about these three days. |
I meant updating the filenames so they point to the correct log file. |
Ah, I see. I am not sure if it is worth changing it. People might have their own workarounds already, and then we'll break them. |
Then, it's probably worth just noting the missing days in the README/webpage. |
I am just posting here it because I wonder if the low download count has anything to do with the date being 29th Feb! cranlogs::cran_downloads(
packages = "ggplot2",
from = "2020-02-25",
to = Sys.Date()
)
#> date count package
#> 1 2020-02-25 42860 ggplot2
#> 2 2020-02-26 44631 ggplot2
#> 3 2020-02-27 42154 ggplot2
#> 4 2020-02-28 34426 ggplot2
#> 5 2020-02-29 5554 ggplot2
#> 6 2020-03-01 0 ggplot2 |
My guess is that part of the reason is that scripts used to do automated downloads may not have accounted for the leap day. That said, the last available leap day, in 2016, wasn't particularly unusual:
|
Also source for R v3.6.3 was released on 2020-02-29. |
Wow. This is probably an oversimplification, but maybe there weren't that many automated downloads back in 2016. IDK if the release date has anything to do with it, but that's easy to check, the other release dates are these: https://rversions.r-pkg.org/r-versions |
The 2020-03-26 was also with 0 downloads. cranlogs::cran_downloads(
packages = "ggplot2",
from = "2020-03-25",
to = Sys.Date()
)
#> date count package
#> 1 2020-03-25 63129 ggplot2
#> 2 2020-03-26 0 ggplot2
#> 3 2020-03-27 63344 ggplot2 |
Download counts are also 0 for 2nd and 3rd of April. cranlogs::cran_downloads(
packages = "ggplot2",
from = "2020-03-31",
to = Sys.Date()
)
#> date count package
#> 1 2020-03-31 66205 ggplot2
#> 2 2020-04-01 65428 ggplot2
#> 3 2020-04-02 0 ggplot2
#> 4 2020-04-03 0 ggplot2
#> 5 2020-04-04 50522 ggplot2 |
2020-04-20 also had 0 downloads. cranlogs::cran_downloads(
packages = "ggplot2",
from = "2020-04-18",
to = Sys.Date()
)
#> date count package
#> 1 2020-04-18 52350 ggplot2
#> 2 2020-04-19 48923 ggplot2
#> 3 2020-04-20 0 ggplot2
#> 4 2020-04-21 63808 ggplot2
#> 5 2020-04-22 0 ggplot2 Created on 2020-04-22 by the reprex package (v0.3.0.9001) |
Haven't seen any downloads for the last week.
|
Seems to have updated; when I rerun the above command, I get more days filled in. Still missing the most recent 2 days though.
|
The log for the current day (e.g. 2020-06-25) isn't be available till the next day (e.g. 2020-06-26). Regarding the 24th, I think they're moving servers/services so my understanding is that they've been manually running the script of late (time zones may come into play as well). FWIW, if you really want the latest counts, you can fetch the logs directly (http://cran-logs.rstudio.com/) or use packages/functions that do so. |
Logs are getting hung up again:
It's weird that the service is so patchy. I'd have thought it's just a daily cron job or something, so that updates "just work". |
Logs seem to be hung up again:
|
No download count for cranlogs::cran_downloads(
packages = "ggplot2",
from = "2020-09-26",
to = Sys.Date()
)
#> date count package
#> 1 2020-09-26 43607 ggplot2
#> 2 2020-09-27 45068 ggplot2
#> 3 2020-09-28 60917 ggplot2
#> 4 2020-09-29 63517 ggplot2
#> 5 2020-09-30 64071 ggplot2
#> 6 2020-10-01 60625 ggplot2
#> 7 2020-10-02 56791 ggplot2
#> 8 2020-10-03 0 ggplot2
#> 9 2020-10-04 43545 ggplot2 Created on 2020-10-06 by the reprex package (v0.3.0.9001) |
There are five days in 2020 that cranlogs::cran_downloads() still reports as having zero downloads:
Would it be possible to fix these? |
The count is 0 also for cranlogs::cran_downloads(
packages = "ggplot2",
from = "2021-11-18",
to = "2021-11-22"
)
#> date count package
#> 1 2021-11-18 115004 ggplot2
#> 2 2021-11-19 106105 ggplot2
#> 3 2021-11-20 0 ggplot2
#> 4 2021-11-21 86233 ggplot2
#> 5 2021-11-22 110980 ggplot2 Created on 2021-11-27 by the reprex package (v2.0.1) |
FWIW, the RStudio logs were posted "late" that day. When that happens, 'cranlogs' will return a zero count. |
There are 43 days when cranlogs::cran_downloads() reports that there were zero package downloads. I've checked a couple of logs at http://cran-logs.rstudio.com/; they seem to disagree.
I'm guessing these will be fixed when you update the DB script (#45).
FWIW
The text was updated successfully, but these errors were encountered: