Skip to content
This repository has been archived by the owner on Jul 18, 2024. It is now read-only.

[DataCap Application] NDLABS - Hyper.ai <1/4> #1720

Closed
1 of 2 tasks
NDLABS-Leo opened this issue Mar 3, 2023 · 55 comments
Closed
1 of 2 tasks

[DataCap Application] NDLABS - Hyper.ai <1/4> #1720

NDLABS-Leo opened this issue Mar 3, 2023 · 55 comments
Assignees
Labels

Comments

@NDLABS-Leo
Copy link

Data Owner Name

NDLABS

Data Owner Country/Region

Singapore

Data Owner Industry

IT & Technology Services

Website

https://www.ndlabs.io/#/

Social Media

Twitter: @imNDLABS
Slack: @NDLABS-OFFICE

Total amount of DataCap being requested

5PiB

Weekly allocation of DataCap requested

1PiB

On-chain address for first allocation

f1d2op7ndmbevlthyonqntkxsg4a47qgliztvhj4i

Custom multisig

  • Use Custom Multisig

Identifier

f1d2op7ndmbevlthyonqntkxsg4a47qgliztvhj4i

Share a brief history of your project and organization

ND LABS has technical operation centers and nodes in Singapore, Hong Kong, the United States, and Dubai. Since Fil has launch of the mainnet in 2020, ND has begun to provide technical services to partners to help them complete the construction of storage services. At present, the accumulated storage power of ND exceeds 300P globally. The largest node has 100P storage power, and the node owns exceeds more than 1.4 million FIL. 
ND LABS is positioned as a decentralized storage service provider for WEB3. For a long time, ND not only focuses on building nodes for partners, but also explores how to provide better storage services for potential clients of web3. Since October 2021, ND has been deeply involved in the FilPlus project, vigorously promoting the Filplus project to partners who has effective data storage needs. We also providing them with a complete set of solutions and technical services for storing data in the FIL network. The Singapore and US nodes are the main storage nodes, which was provide real data storage for early customers.

Is this project associated with other projects/ecosystem stakeholders?

Yes

If answered yes, what are the other projects/ecosystem stakeholders

We gonna cooperate with more SPs from other regions

Describe the data being stored onto Filecoin

https://hyper.ai/datasets is an open data web, which collects hundreds of datasets with a total capacity of 1.8P for public use.
Details of Datasets including:
Audi Autonomous Driving Dataset - 2.26T
BDD100K - 1.81T
BDDK - 1.8T
LSUN 20 Object Categories - 1.69T
Youtube 8M - 1.52T
WebVision1.0+2.0 - 1.26T
ApolloSpace - 1.19T
TrackingNet - 1.04T
...
FYI, to release the concern from community members of "one LDN combined smaller sets of data". 
When we prepare the data, different datasets will be tagged variously, to classify into our database. If we need to index later, we can quickly and easily find the file corresponding to the demand. In addition, we are also developing a browser to solve the "merged data"

Where was the data currently stored in this dataset sourced from

Other

If you answered "Other" in the previous question, enter the details here

https://hyper.ai/datasets

How do you plan to prepare the dataset

IPFS, lotus, singularity, graphsplit

If you answered "other/custom tool" in the previous question, enter the details here

No response

Please share a sample of the data

https://hyper.ai/datasets/15610
https://hyper.ai/datasets/11926
https://hyper.ai/datasets/5570
https://hyper.ai/datasets/5678
https://hyper.ai/datasets/5531
https://hyper.ai/datasets/16979
https://hyper.ai/datasets/5197
https://hyper.ai/datasets/16773
https://hyper.ai/datasets/8754
https://hyper.ai/datasets/4889
https://hyper.ai/datasets/17858
https://hyper.ai/datasets/16184
https://hyper.ai/datasets/5191
https://hyper.ai/datasets/5358
https://hyper.ai/datasets/9430

Confirm that this is a public dataset that can be retrieved by anyone on the Network

  • I confirm

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Yearly

For how long do you plan to keep this dataset stored on Filecoin

1 to 1.5 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, North America

How will you be distributing your data to storage providers

HTTP or FTP server, IPFS, Shipping hard drives

How do you plan to choose storage providers

Slack, Big data exchange, Partners

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

No response

How do you plan to make deals to your storage providers

Boost client, Lotus client

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

@large-datacap-requests
Copy link

Thanks for your request!
Everything looks good. 👌

A Governance Team member will review the information provided and contact you back pretty soon.

@Sunnyiscoming
Copy link
Collaborator

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

1PiB

Client address

f1d2op7ndmbevlthyonqntkxsg4a47qgliztvhj4i

@large-datacap-requests
Copy link

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1d2op7ndmbevlthyonqntkxsg4a47qgliztvhj4i

DataCap allocation requested

256TiB

Id

62668cd9-9fc0-45aa-a4ed-579dcd7bd855

@kernelogic
Copy link

I have checked https://hyper.ai/datasets and looks like it's very comprehensive for dataset indexing. Willing to support.

@sxxfuture-official
Copy link

@NDLABS-OFFICE
Can you provide your future sealing plans?
Including SP and its region, the way of data transport.

Copy link

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacea6tearib5l5d2oshi2m5wr66aiyr43tq7kcssxjb3l4mdr4tkxys

Address

f1d2op7ndmbevlthyonqntkxsg4a47qgliztvhj4i

Datacap Allocated

256.00TiB

Signer Address

f1yjhnsoga2ccnepb7t3p3ov5fzom3syhsuinxexa

Id

62668cd9-9fc0-45aa-a4ed-579dcd7bd855

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacea6tearib5l5d2oshi2m5wr66aiyr43tq7kcssxjb3l4mdr4tkxys

@NDLABS-Leo
Copy link
Author

@sxxfuture-official
ND has separate server rooms in Singapore, Hong Kong, and the US, and currently has at least another 40P+ of storage capacity, as well as multiple machines for building CAR files and web servers to provide download services. At all three of these separate server rooms (with approximately 100Mbps of bandwidth allocated to the Fil+ project) we can download, package, and send the above datasets.
Moreover, during the time ND has been active in the community, we have contacted many high-quality SPs in the industry, and we will distribute data to these high-quality SPs for storage. In addition, the storage middleware developed by ND will check the data for duplication to ensure that the data is not duplicated or misused. In addition, the nodes provided by ND and high-quality SPs in the industry will support data retrieval, and we will also provide data storage CIDs for network-wide retrieval if needed.
In addition, 1521 is our previous storage project, and the nodes can be seen above.

@sxxfuture-official
Copy link

By checking the public big data set link provided, the information is true and reliable, and ND is not a new account, so I will support the project in this round.

Copy link

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceatdg6pmzjn6azijyyvwttss2uhx374gmt6csjfz5yqojhuxyb2jc

Address

f1d2op7ndmbevlthyonqntkxsg4a47qgliztvhj4i

Datacap Allocated

256.00TiB

Signer Address

f1foiomqlmoshpuxm6aie4xysffqezkjnokgwcecq

Id

62668cd9-9fc0-45aa-a4ed-579dcd7bd855

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceatdg6pmzjn6azijyyvwttss2uhx374gmt6csjfz5yqojhuxyb2jc

@large-datacap-requests
Copy link

Stats & Info for DataCap Allocation

Multisig Notary address

f02049625

Client address

f1d2op7ndmbevlthyonqntkxsg4a47qgliztvhj4i

Rule to calculate the allocation request amount

400% weekly > 2PiB, requesting 2PiB

DataCap allocation requested

1.25PiB

Total DataCap granted for client so far

1.862645149230957e+37YiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

1.862645149230957e+37YiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
93356 22 2PiB 22.39 508.18TiB

@NDLABS-Leo
Copy link
Author

FYI, we are disclosing here for tracking by notaries.
We have located that the reason why ND and the SPs we cooperate with cannot be searched and sampled by Filplus RetrievalBot is that our code is not synchronized with the latest boost code, so the payloadCID (rootCID) cannot be retrieved. However, pieceCID can be retrieved, which can be seen in previous notaries' reviewing process.
ND and the SPs we cooperate with store 100% unseal files, from the beginning to the present. Our engineers are simultaneously modifying the code and updating the architecture, which is expected to be completed early this week. Looking forward to your understanding and signing. Thanks a lot.

@1ane-1
Copy link

1ane-1 commented Jun 26, 2023

I will support you but this week hope you can deal with it and i will check that.

Copy link

1ane-1 commented Jun 26, 2023

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacedvgkupvxjacxmz7jkspfeddp2j5irblknnb3uyzqqsj2dxqpme3q

Address

f1d2op7ndmbevlthyonqntkxsg4a47qgliztvhj4i

Datacap Allocated

1.25PiB

Signer Address

f1mdk7s2vntzm6hu35yuo6vjubtrpfnb2awhgvrri

Id

13fae895-1552-4b1d-8108-501b8c96a999

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedvgkupvxjacxmz7jkspfeddp2j5irblknnb3uyzqqsj2dxqpme3q

Copy link

psh0691 commented Jun 26, 2023

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacedqasx2trmnlhb4sj6hxwvhixbh474o2hcnsigsrergzavtbgkypk

Address

f1d2op7ndmbevlthyonqntkxsg4a47qgliztvhj4i

Datacap Allocated

1.25PiB

Signer Address

f1qdko4jg25vo35qmyvcrw4ak4fmuu3f5rif2kc7i

Id

13fae895-1552-4b1d-8108-501b8c96a999

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedqasx2trmnlhb4sj6hxwvhixbh474o2hcnsigsrergzavtbgkypk

@NDLABS-Leo
Copy link
Author

Since the project has been signed, the application form will be closed, but we will still synchronize the processing of data in the application form.

@NDLABS-Leo
Copy link
Author

image

38362483
baga6ea4seaqlfhcdf5gpxgetvnm74y3y6zfgn654a6x4flrillbrk4pugmysgjq
bafykbzacedieutroihh6t3zzstdd4d2nr42t46dru7mqn2nuru6abl3jq2tw4
52153
38363254
baga6ea4seaqc72xasi27uv7cp4psd2dsgfnur5xjdefut4c2ozaciey52tcsehy
bafykbzacebqizirqsewqnb7vxlnlybmfb4f52fecgiagowz6l6y3dpgecbxdg
52154
38399061
baga6ea4seaqmr53smv7nxhoktm3fc5fmas2hcrmfwoyyfpuyb2eefmyeqqhumoq
bafykbzaceb7nlwd7me2rzxnpuw3zs5sypr46gutxey3kj33vwupwpgeaoro52
52155
38399065
baga6ea4seaqkzp3bjtjnraomk6alfp7rkdfttpkyfb4a5rta542cw3cfse376pa
bafykbzacec4itkvpe22h6dhkz53c3ofdgj446uzri6xkfjnhw3evj4ms2oz4s
52160
38399062
baga6ea4seaqb6nu6roqyb53w6peibborkwzsp6gcjvizhh6w66ikwh2clfo7agq
bafykbzaceawaf5cqgj3pc24gxtlyahmhdrxdl7ndnfg73bvhyt6ffbljl7aj4
52156
38399067
baga6ea4seaqnoo5qmp7uozrdnxnr7t3dof57vg6avtvnzuxspxze5pnf4kve6pa
bafykbzacebzh4d3t7il4qlyeydchogqzlipiplecg4jeummotb72sox7kg6es
52161
38399066
baga6ea4seaqd7rylsyltj74iw4ziltnfn2ftm53feexbgyoit2ldryuvkh2oyji
bafykbzaced7r74puwoidmwqduirjb3ahytasb372lmmyr6h2cgjp3ricm5swi
52157
38399064
baga6ea4seaqabiyszmplwpue6m6tllbo4eykzswfekgs5qpixnnijplzppnv6pi
bafykbzacecfwdgib2sz5xtl3bwepthfg74wi2azq2ijpxs7nxy6qs3k2rxtho
52159
38399063
baga6ea4seaqitnxtpmirvxqec7qtk3ippiwnhagl5r6apwskygl3hycoqqi7aii
bafykbzacectklij7eyb4cksmdtwjficcsoatnurx3s76xs7wjepv4uzaruw7y
52158

As these CIDs can be retrieved by picec_cid, but cannot be retrieved in check bot, currently contacting sp to adapt the technology to support root cid retrieval

@Chris00618
Copy link

CID is still unretrievable
Screenshot(1)(1)

@NDLABS-Leo
Copy link
Author

FYI, we are disclosing here for tracking by notaries. We have located that the reason why ND and the SPs we cooperate with cannot be searched and sampled by Filplus RetrievalBot is that our code is not synchronized with the latest boost code, so the payloadCID (rootCID) cannot be retrieved. However, pieceCID can be retrieved, which can be seen in previous notaries' reviewing process. ND and the SPs we cooperate with store 100% unseal files, from the beginning to the present. Our engineers are simultaneously modifying the code and updating the architecture, which is expected to be completed early this week. Looking forward to your understanding and signing. Thanks a lot.

As I disclosed above, currently we can only retrieve it with the following command:
#Search command:
lotus client retrieve --provider 节点号 --pieceCid piece_cid payload_cid ~/

#Example
lotus client retrieve --provider 38362483 --pieceCid baga6ea4seaqlfhcdf5gpxgetvnm74y3y6zfgn654a6x4flrillbrk4pugmysgjq bafykbzacedieutroihh6t3zzstdd4d2nr42t46dru7mqn2nuru6abl3jq2tw4 ~/

@filplus-checker-app
Copy link

DataCap and CID Checker Report Summary1

Retrieval Statistics

⚠️ All retrieval success ratios are below 1%.

  • Overall Graphsync retrieval success rate: 0.19%
  • Overall HTTP retrieval success rate: 0.00%
  • Overall Bitswap retrieval success rate: 0.00%

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 97.27% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients2

⚠️ CID sharing has been observed. (Top 3)

Full report

Click here to view the CID Checker report.
Click here to view the Retrieval report.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@data-programs
Copy link
Collaborator

KYC

This user’s identity has been verified through filplus.storage

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests