Skip to content

starrydata/starrydata_datasets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

42 Commits
 
 
 
 
 
 
 
 

Repository files navigation

starrydata_datasets

Dataset Repository

Repository Description Update Schedule Period
Google Drive Latest dataset only Twice daily at 00:00 and 12:00 from 2024/06/13
Figshare Past datasets Daily until 2024/06/06, then monthly from 2022/12/22
Github Past datasets As needed from 2019/7/11 until 2022/12/22

Changelog

2024/07/04

  • Excluded datasets with the data type "calculation" in the descriptor from the sample dataset and curve dataset. As of 2024/07/01 12:00:01 UTC+0900 (JST), there were 346 samples.

2024/06/26

  • Changed dataset file name prefix from "all" to "starrydata". For example, all_curves.csv is now starrydata_curves.csv.
  • Changed the file extension of the paper dataset from JSON to CSV for availability.
  • Reduced the columns in the paper dataset to only those necessary for citation, reducing the file size from 400MB to about 50MB.
  • Added project_names and created_at to the paper dataset.

2024/06/13

  • The latest datasets are now uploaded to Google Drive.

2024/06/06

  • Fixed the character corruption issue when users open all_samples.csv in certain applications, such as Excel, by adding a BOM.
  • The upload schedule to Figshare has been changed from daily to monthly.

2024/05/22

  • Fixed the incorrect timestamp format in the dataset. For example, corrected "2024-05-17 00:00:01 JST+0900" to "2024-05-17 00:00:01 GMT+0900 (JST)".

2024/05/21

  • The values in the XY value list were originally strings enclosed in double quotations. These double quotations were removed for easier analysis.
  • e.g. ["299.8597", "324.8683"] -> [299.8597, 324.8683]

2024/05/16

  • Added updated_at, created_at, and composition_details to all_samples.csv.

2022/12/22

  • The dataset location was changed from this GitHub repository to Figshare.