-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
IPFS Garbage Collection v2 #8
base: main
Are you sure you want to change the base?
Changes from 1 commit
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,103 @@ | ||
# Project Pitch: IPFS Garbage Collection v2 | ||
###### Making GC fast, efficient, and targeted | ||
|
||
Authors: @gammazero | ||
|
||
## Purpose and Impact | ||
|
||
### Background and Intent | ||
When storage is reclaimed by removing unpinned blocks, this garbage collection requires stopping other IPFS activities for its duration, and can take significant time. For users such as [Pinata](https://pinata.cloud/), this can take “literally days” for a single node. For large pin sets, this is expensive in both memory and time. Garbage collection needs to operate differently, to significantly reduce the impact on IPFS. Another request from multiple users is to be able to remove specific content from an IPFS node. Currently, GC removes all unpinned blocks. It would be useful for GC to be able to remove all blocks associated with a specific DAG. | ||
|
||
After this project, these specific pain-points will be resolved: | ||
- GC takes too long for nodes with large pin sets | ||
- Need to GC content when it becomes unpinned | ||
- Need to be able to remove specific content | ||
|
||
This project is critical for continued growth of users such as Pinata, as the current GC implementation is a significant obstacle for them. The project is also important to users who need to remove specific content from their nodes, but do not want complete GC. | ||
|
||
### Assumptions and Hypotheses | ||
|
||
For this project to matter, it must significantly improve operations for highly impacted customers like Pinata. Faster GC will have a directly improve Pinata’s use of IPFS. Having the ability to remove content when it becomes unpinned will further enhance the improvement for Pinata’s use case, as GC can avoid looking at all blocks and can operate on only those that became unpinned. | ||
For users who do not see or care about the GC performance issues, this project will still matter to users who need to remove specific content from nodes, but do not want to remove all unpinned content. | ||
|
||
### User workflow example | ||
|
||
#### Bulk GC | ||
A user would use the new GC in the same way as the previous version, but with potentially significant reductions in duration and therefore less disruption to IPFS operation. This may eliminate the need to work around the disruption by doing things like taking nodes offline to perform GC. Ideally, GC lock time should be minimized to allow GC to be done effectively concurrently with normal IPFS operation. | ||
|
||
#### Removing Specific Content | ||
Certain users may only need to remove content that becomes unpinned. This is particularly true of pinning services such as Pinata, and will be their primary, if not only, method of garbage collection. To remove content that becomes unpinned, the command line workflow is: | ||
- Unpin the desired content: `ipfs pin rm <cid>` | ||
- GC the unpinned content: `ipfs repo gc --cid=<cid>` | ||
|
||
The above could also be done with a single command, if it adds value: `ipfs pin rm -gc <cid>` | ||
It may be valuable once local named pinning is available. | ||
|
||
In the case that a user wants to remove content that was never pinned, they execute step 2 from above. This tells GC to walk the IPLD DAG identified by the CID and remove any associated blocks that are not referenced. | ||
|
||
### Impact | ||
While impact varies greatly for different types of users, some of our most important users (e.g. Pinata) are heavily impacted and have a need for this feature. It will have a significant effect on their business operations. Therefore, this gets an impact rating of 3. | ||
🔥🔥🔥 | ||
|
||
### Leverage | ||
This fixes a few specific issues, but is not highly leverageable. The ability to remove specific content may allow implementation of a custom data retention policy. Therefore this gets a 1 for leverage. | ||
🎯 | ||
|
||
### Confidence | ||
Medium (level 5) confidence, according to [this scale](https://medium.com/@nimay/inside-product-introduction-to-feature-priority-using-ice-impact-confidence-ease-and-gist-5180434e5b15)_. | ||
|
||
## Project definition | ||
|
||
### Brief plan of attack | ||
momack2 marked this conversation as resolved.
Show resolved
Hide resolved
momack2 marked this conversation as resolved.
Show resolved
Hide resolved
|
||
|
||
1. Design how reference counting is implemented, exploring what interfaces will change, and what subsystems will be affected. Address handling storage limits, preventing removal of newly arrived blocks, need for transactions in datastore, etc. | ||
2. Implement reference counting of blocks in the blockstore. Counts are updated by pinner and MFS. | ||
3. Implement new garbage collector that operates on reference counts instead of pinned CIDs | ||
4. Implement targeted GC (GC that follows a DAG and deletes associated unreferenced blocks) | ||
5. Optional, if it adds value: Implement command to remove pin and then immediately GC the unpinned CID | ||
6. Distribute an experimental version of go-ipfs for evaluation and feedback. | ||
|
||
Garbage collection performance issues arise because GC attempts to remove blocks by loading a set of all pinned CIDs into memory and searching for each block’s CID in this set. The plan attack operates on the following premises: | ||
|
||
- Maintaining reference counts for blocks will make it quick to determine if a block can be removed, because it is not necessary to load all pinned CID into memory and search this. | ||
- Specific content can be removed by performing GC on the blocks associated with a IPLD DAG identified by CID. | ||
gammazero marked this conversation as resolved.
Show resolved
Hide resolved
|
||
|
||
### What does done look like? | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Related to other comments about metrics, should done also include metrics about GC time and the time it takes to pin/unpin now given the reference counting? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I don't think it makes more sense to have time metrics for pinning, because the pinning process may also need to fetch content from the network as it is being pinned. I don't think that metrics for unpin become any more important, as this should be equivalent to walking a dag whose. |
||
- Block reference counts stored in datastore | ||
- Block reference counts updated by pinning, unpinning, adding to MFS, removing from MFS. | ||
- New GC implementation that operates on block reference counts instead of on set of all pinned CIDs | ||
- Ability for GC to remove content identified by CID | ||
- New `--cid` option: `ipfs repo gc --cid=<cid>` | ||
|
||
Removing pins will be slower because it requires walking DAG to decrement reference counts. Adding items to MFS and removing items from MFS will be slower due to needing to update reference counts. Hopefully any increase in execution time will be insignificant. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Seems like we should use piñata as an example here and estimate what the impact would be (since they add pins all the time, but don’t GC as often now) There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I caught up with Matt to try and grab some numbers here: There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. @momack2 I will add some rough estimates by timing some things that I think will be similar in time to the new GC, and comparing with the old GC. @jnthnvctr - I added a link to this in the doc, and brought some of the content into the doc. Thank you! |
||
|
||
### What does success look like? | ||
Success, for Pinata, means that they can remove unpinned content with significant reduction in time that node is unavailable during GC. Best case is that this allows Pinata to perform GC without having to take a node offline, and with little or no impact to IPFS running on the node. | ||
|
||
Success also means that this functionality does not have a significant negative impact, such as consuming large amounts of storage space, or impairing the performance of operations outside of GC This must be true for all usersA user should be able to remove only content that becomes un | ||
gammazero marked this conversation as resolved.
Show resolved
Hide resolved
|
||
|
||
### Counterpoints and pre-mortem | ||
If high performance GC results in reduced performance elsewhere, users not affected by GC performance issues will see this as a negative impact. The following areas could be negatively affected by the new GC, but hopefully none will be significant for any user | ||
- Removing pins will be slower because it requires walking DAG to decrement reference counts. | ||
- Adding items to MFS and removing items from MFS will be slower due to needing to update reference counts. | ||
- Keeping reference counts will consume for storage space | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Will we have metrics for all of these? I think it would be good to add the instrumentation before the improvements so we can compare before/after. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I think that eventually there should be metrics for this, but probably not for the initila implementation. Reference counts will be stored in a specific location and a user should be able to see how much space is used on disk in this location. |
||
|
||
The project could fail, or fail to be deliverable in the proposed time if reference counting incurs unforseen complexity in its implementation or changes required to other subsystems. The risk of the will be mitigated by early review of reference counting design and prototyping. | ||
gammazero marked this conversation as resolved.
Show resolved
Hide resolved
|
||
|
||
### Alternatives | ||
There does not currently appear to be alternate ways to solve these issues. There may be tools that are better suited to implementing portions of GC, such as a specific key-value store that is better suited to managing reference counts | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I know we'll have folks from the Stewards team take a look at the design but are there others at PL or in the PL community that have expertise in "GC in practice"? Is there a channel we can post to see? For example, if there is anyone in our midst who worked on tuning GC at Sun/Oracle, they likely have valuable high-level guidance on doing efficient GC in practice. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I have reached out to some people already, and names can be discovered by looking at previous work here: ipfs/kubo#7752 I think reposting the design on the ipfs slack channel, and inviting people to early reviews of PRs will be good. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Do share the design doc in our internal #engineering channel. |
||
|
||
### Dependencies/prerequisites | ||
Desirable, but not required: Possible MFS bug fix for reported loss of MFS root | ||
|
||
### Future opportunities | ||
- Incremental GC: Remove some unpinned blocks as needed to maintain storage limits. | ||
- Choice of, or flexible, data retention configuration. | ||
|
||
## Required resources | ||
|
||
### Effort estimate | ||
1-2 people 4-6 weeks | ||
|
||
### Roles / skills needed | ||
Go + IPFS expertice |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think more anecdotes/justification from the conversation in https://protocollabs.slack.com/archives/G01KZD3FETY/p1613779249428300 can be added. The transitive impact that this has on NFTs is important as we don't want to hamper that usecase.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This link did not work for me, but I added it to related discussions below
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This link did not work for me, but I added it to the "Related discussions" section anyway