Skip to content
This repository has been archived by the owner on Jul 18, 2024. It is now read-only.

[DataCap Application] FileDrive Labs - Smithsonian Open Access #1688

Closed
1 of 2 tasks
laurarenpanda opened this issue Feb 27, 2023 · 101 comments
Closed
1 of 2 tasks

[DataCap Application] FileDrive Labs - Smithsonian Open Access #1688

laurarenpanda opened this issue Feb 27, 2023 · 101 comments

Comments

@laurarenpanda
Copy link

Data Owner Name

FileDrive Labs

Data Owner Country/Region

China

Data Owner Industry

Life Science / Healthcare

Website

https://filedrive.io/

Social Media

Twitter: https://twitter.com/FileDrive1
Medium: https://medium.com/@FileDrive1
WeChat Offical Account: FileDrive

Total amount of DataCap being requested

5PiB

Weekly allocation of DataCap requested

500TiB

On-chain address for first allocation

f1udumyw3yjzxuu5co4rateaq6czubrwbyy2t4jiq

Custom multisig

  • Use Custom Multisig

Identifier

No response

Share a brief history of your project and organization

FileDrive Datasets Landing Plan is a project for onboarding more valuable public datasets onto the Filecoin network. Through several phases, we plan to bring 10 PiB data and promote 100 PiB storage power growth to Filecoin. 


About FileDrive Datasets

FileDrive Datasets is a platform to effectively connect the huge storage market that Filecoin has built with publishers of public datasets.
The Filecoin network provides reliable, secure, and affordable decentralized storage services, and FileDrive Labs wants to deliver these benefits to end-users by building a public dataset platform.
It is challenging to attract traditional Cloud Storage and Object-base Storage users to the Filecoin network and benefit from it. Developers in the Felicoin ecosystem, such as FileDrive Labs, need to face this challenge together.
As a member of the Filecoin ecosystem, FileDrive Labs has been insisting on developing useful tools to make it easier for users to store their data onto the Filecoin network. 

FileDrive Datasets has integrated a group of tools to provide storage service with the compatibility of both Cloud Storage and Object-base Storage and better user experience to attract more users.
Projects(ongoing) behind:
- Go-Graphsplit: https://github.com/filedrive-team/go-graphsplit
- DS-Cluster: https://github.com/filedrive-team/go-ds-cluster
- Filejoy: https://github.com/filedrive-team/filejoy

Article about FileDrive Datasets on Filecoin Blog:
- Large Datasets: FileDrive: https://filecoin.io/blog/posts/large-datasets-filedrive/



About FileDrive Labs

FileDrive Labs has always defined ourselves as tool developers and infrastructure builders in the Filecoin ecosystem. From 2019, we continuously focus on technical solutions and development based on IPFS protocol and the Filecoin network and do our best to contribute to the community.
Over 80% of our team are qualified engineers, and half of them have more than 10-year development experience in multiple industries, including Communication, the Internet, and blockchain.
Since 2020, we have participated in Slingshot Competition, become one of the top teams, and stored over 5 PiB useful data from public datasets to the Filecoin network.
To contribute to the Filecoin Community, we developed an open-source data prep tool Graphsplit, FIL+ project dashboard filplus.info and storage provider discovery platform filfind,info.
Besides, we have also hold weekly online virtual events named FileDrive Meetup from March 2022, which aims to provide a platform for community members to grasp the latest trends of the Filecoin network and our work and research.

Please check the following links for more details.
- GitHub: https://github.com/filedrive-team
- Twitter: https://twitter.com/FileDrive1
- Eventbrite: https://www.eventbrite.hk/o/filedrive-labs-42456337463
- YouTube Channel: https://www.youtube.com/channel/UCxcZC1dtBUlQvZY7DX13W1w
- Medium: https://medium.com/@FileDrive1

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

Smithsonian Open Access
- The Smithsonian’s mission is the "increase and diffusion of knowledge" and has been collecting since 1846. The Smithsonian, through its efforts to digitize its multidisciplinary collections, has created millions of digital assets and related metadata describing the collection objects. On February 25th, 2020, the Smithsonian released over 2.8 million CC0 interdisciplinary 2-D and 3-D images, related metadata, and additionally, research data from researches across the Smithsonian. The 2.8 million "open access" collections are a subset of the Smithsonian’s 155 million objects, 2.1 million library volumes and 156,000 cubic feet of archival collections held in 19 museums, 9 research centers, libraries, archives and the National Zoo. Digitization of collections is ongoing.
- https://registry.opendata.aws/smithsonian-open-access/
- License: CC0
- Size: 621.2 TiB

Where was the data currently stored in this dataset sourced from

My Own Storage Infra

If you answered "Other" in the previous question, enter the details here

No response

How do you plan to prepare the dataset

IPFS, lotus, graphsplit

If you answered "other/custom tool" in the previous question, enter the details here

No response

Please share a sample of the data

Original Source:
https://registry.opendata.aws/smithsonian-open-access/

Confirm that this is a public dataset that can be retrieved by anyone on the Network

  • I confirm

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Weekly

For how long do you plan to keep this dataset stored on Filecoin

2 to 3 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, North America, Europe, Australia (continent)

How will you be distributing your data to storage providers

IPFS, Shipping hard drives, Lotus built-in data transfer

How do you plan to choose storage providers

Slack, Filmine

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

Please check the Checker Reports of our previous LDN applications:
- https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1266

How do you plan to make deals to your storage providers

Lotus client

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

@large-datacap-requests
Copy link

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!

@large-datacap-requests
Copy link

Thanks for your request!
Everything looks good. 👌

A Governance Team member will review the information provided and contact you back pretty soon.

@Sunnyiscoming
Copy link
Collaborator

You mentioned this dataset in 8 applications. How much data of this dataset has been stored? How many copies?
#1266
#1267
#1268

Because there is no consensus in whether the client should submit one by one, 5 applications have not been updated.
#1623
#1624
#1625
#1626
#1627

@laurarenpanda
Copy link
Author

@Sunnyiscoming
We have yet to store this dataset with the DC from #1266, #1267, and #1268 (2451.1TiB data, 15 PIB DC with 6-11 copies).
So we move this one into Landing Plan V2.
Since you suggested that we submit applications for each dataset by dataset, I submitted this LDN after.

Proposal 832 is still under discussion and has not been passed by the consensus from the community and Notaries.
So, I am still confused about what I should do at present.

@Sunnyiscoming Sunnyiscoming self-assigned this Mar 2, 2023
@Sunnyiscoming
Copy link
Collaborator

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

500TiB

Client address

f1udumyw3yjzxuu5co4rateaq6czubrwbyy2t4jiq

@large-datacap-requests
Copy link

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1udumyw3yjzxuu5co4rateaq6czubrwbyy2t4jiq

DataCap allocation requested

250TiB

Id

e30a5bb4-9378-4b93-a10a-d992b77021bb

@Fatman13
Copy link
Contributor

Fatman13 commented Mar 3, 2023

checker:manualTrigger

@filplus-checker-app
Copy link

DataCap and CID Checker Report1

No application info found for this issue on https://filplus.d.interplanetary.one/clients.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

@laurarenpanda
Copy link
Author

@Fatman13
Please check for the historical deal report:
#1266 (comment)

@Fatman13
Copy link
Contributor

Fatman13 commented Mar 3, 2023

What was the reason for CID sharing again? I remember seeing you explaining it somewhere but couldn't find it.

@laurarenpanda
Copy link
Author

What was the reason for CID sharing again? I remember seeing you explaining it somewhere but couldn't find it.

The same public datasets with the same preprocessing tool, like Go-Graphsplit, could lead to that result.

Copy link
Contributor

Fatman13 commented Mar 3, 2023

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacecxngctrafbtevjmqfgpanrkzaxgkxsek3kong3t6hskx4uzie7ga

Address

f1udumyw3yjzxuu5co4rateaq6czubrwbyy2t4jiq

Datacap Allocated

250.00TiB

Signer Address

f1j3u7crhjzwb2cj5mq7vodlt4o66yoyci7lhcauy

Id

e30a5bb4-9378-4b93-a10a-d992b77021bb

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecxngctrafbtevjmqfgpanrkzaxgkxsek3kong3t6hskx4uzie7ga

@liyunzhi-666
Copy link

Through disclosure records and comment history,I would like to support this round.

Copy link

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacebb56cfyjxmba45m6dseb2zyo3zfk2rwnxl22wrwxvfoqj63jhceu

Address

f1udumyw3yjzxuu5co4rateaq6czubrwbyy2t4jiq

Datacap Allocated

250.00TiB

Signer Address

f1pszcrsciyixyuxxukkvtazcokexbn54amf7gvoq

Id

e30a5bb4-9378-4b93-a10a-d992b77021bb

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacebb56cfyjxmba45m6dseb2zyo3zfk2rwnxl22wrwxvfoqj63jhceu

@data-programs data-programs added the kyc verified User has passed KYC check label Dec 27, 2023
Copy link

DataCap Allocation requested

Request number 6

Multisig Notary address

f02049625

Client address

f1udumyw3yjzxuu5co4rateaq6czubrwbyy2t4jiq

DataCap allocation requested

1.03TiB

Id

0d182a40-e0ad-4d99-9873-871a0d00d07d

Copy link

github-actions bot commented Jan 7, 2024

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

--
Commented by Stale Bot.

@laurarenpanda
Copy link
Author

Please keep this open.

Copy link

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

--
Commented by Stale Bot.

@laurarenpanda
Copy link
Author

Please keep this application open.

Copy link

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

--
Commented by Stale Bot.

@laurarenpanda
Copy link
Author

Please keep this application open.

Copy link

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

--
Commented by Stale Bot.

@laurarenpanda
Copy link
Author

Please keep this application open.

Copy link

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

--
Commented by Stale Bot.

Copy link

This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!

--
Commented by Stale Bot.

@laurarenpanda
Copy link
Author

Please keep this application open.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests