-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
community curation response rate #651
Comments
I mentioned today in the group meeting that this had gone up to 43.6% recently last week... I think it's statistically significant because If we plotted the response rate I'm sure it is a continuously upward trajectory... which is basically what we are interested in...I want to get to 50% this year... @kimrutherford is it easy to include this as a graph in the stats? It would be much nicer than the number. It's not urgent but it might be a nice quick task if you want something "alternative" to the big browser elephant.... Does that all make sense? |
All the data is available so it wouldn't be too hard. There are some edge cases to think about. Like this session which was sent out twice, in different years: |
I envisaged that we would just use the ratio of the ones which are sent out vs. the one sent back. So, the numbers To date 1361 publications have been assigned to community members for curation. 597 are finished and are either in the main PomBase database or are currently being checked by the PomBase curators. That's a response rate of 43.8%. so its always the first date sent out (things which are sent out multiple times are just reminders). I envisage that the graph will look like this: i.e goes up continually but very slowly. I'm keep it going up by sending out enough reminders to sustain an increase. I don't send out too many at once as we would be swamped... Eventually it will plateau when we are just left with the people who will never do any. We are a long way from that yet.... I'm still getting lots of "sorry I will do it" and a good uptake when I send reminders, even for old sessions... |
y axis is % |
I might be wrong because I don't know what the graph would look like at the start when the number of session was low! Actually I think it may begin at about 30%. Certainly for the past few years it has been going up slowly (this is partially due to the fact that the uptake on new papers is usually more immediate, it's old ones that are stagnating....) |
44.1%. .....we will get to 50% by the end of the year I'm sure..... |
44.3%..... |
It was 32% when I did this presentation: |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
Will keep this open, would nice to see the cumulative increase on the stats page: |
Is that true? If you sent out a bunch of sessions won't the response rate (temporarily) drop? |
actually, that isn't the response rate graph, its the other one (2B), they look similar. I would upload it but i need to swap laptops and mail it to myself because I can't upload to github on the other laptop. |
I've done some querying in Chado. I think the numbers don't match up with the 50% response rate shown in Canto because not all of the publications in Canto are exported to Chado. There are community sessions triaged as "Erratum" and "Wrong organism" for example which aren't exported. I've made a new report "uncuratable publications with a community session" to help work this out: Is a session is approved, the Canto details are exported to Chado regardless of the triage status. This publication is an Erratum, but has an approved session: Here are the numbers from Chado:
Note to self, query with:
|
Ah OK. PMID:31579888 is the one which had 2 PMIDs. This ID will be deleted. Some are methods papers. Occasionally people get annotations from methods papers. We want to class these as "methods" & "curated" One day we need to sort the classification so the "publication type" and " curation status" are separate |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
53.9% still increasing |
Latest query result:
|
I had the query wrong and it was making a mess of the older sessions.
|
I've added a curation response rate graph. Hopefully it will be on the main site in the morning but I've just had to restart the load so we'll see. In the meantime it available on my desktop version: https://desktop.kmr.nz/curation_stats |
The load finished after a few false starts. GitHub was returning errors when the load script trying to check for the latest Mondo. https://pombase.org/curation_stats
I'm still not 100% sure I have it right so I plan to check it again tomorrow after a good sleep. :-) |
Unfortunately the date stamps needed from Canto only go back to mid 2013.
It's calculated using the submitted date. It does that so that it matches the Canto stats page which uses the number of submitted sessions. |
I'm going to look at this again in the morning because I've just spotted another problem. Currently it counts submitted sessions up to a given year and then divides by sessions sent out up to the same year. But it's going to get this wrong for sessions that were submitted in a different year to the year they were sent out. There are quite a few of those. Whoops. |
Should the years in the graph be the year sent out or the year submitted? Or year approved? |
submitted I think (the gap between submission and 1st approval should be less than a week 90% of the time so these numbers should be very similar) |
This comment was marked as outdated.
This comment was marked as outdated.
I think the graph from the paper might be wrong but let's have a chat about this on the next call. I've double checked the query that generates the current graph and I think it's correct. But it could be that it's not asking the right question. |
For Kim: find backup from December 2012 to add response rate for that year |
After a bit of digging, the response rate for 2012 was 91.6% There were 12 community sessions sent out and 11 were submitted. Did you send them to people you knew would respond?
|
Yes, I think that was probably the pilot project sessions. I put them all through later as community curated (or we changed them to community curated), I don't quite remember. |
Some sessions are approved before they are accepted which shouldn't be possible, so use the oldest date. Refs pombase/pombase-chado#651
I'll close this as it's getting long and I think it's done. |
In the stats, we report the response rate as a percentage (currently around 42%). It goes up, but very slowly. It would be nice to have a cumulative graph showing the growth over time eventually (the only way is up)
The text was updated successfully, but these errors were encountered: