-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
gc: remove cache files that were already pushed to remote #2036
Comments
dvc gc
to remove cache files for specific targets
Hi @shaunirwin ! The problem with applying Thanks, |
I think I have possible use case for this request. TL;DR: there should be a way to Detailed use case I have two dirs under dvc control:
At first I worked with
Since I commited and pushed P.S.: I'm not sure that |
My use case: |
In terms of implementation, it would look like this:
That last "used =" is needed to transform str checksums returned by |
@efiop I'd like to take a swing at it :) |
@kaiogu Please let us know if you need any help 🙂 |
Hi, I was on long vacation, but am back now and will start this today. |
@kaiogu Sounds good, let us know if you'll have any questions :) Btw, we have a dev-general channel in our discord, please feel free to join 🙂 |
@efiop, I was thinking of calling it "safe" because it would only clear the cache if there was a backup safe in a remote. I'll let you know if I get stuck :) |
@kaiogu |
|
Another option is to include this into push somehow: dvc push --remove |
Or |
I would expect |
Any updates on this feature? |
@Viktor2k Not actively working on this directly yet 🙁 |
Me and my company would also be interested in this! It seems this feature would be quite useful to be able to use DVC for sharing locally downloaded datasets across multiple projects on remote servers. For this, we need a way to clean up the local cache from currently no longer needed data, though the data is still required to reproduce results in the future (hence they are still referenced in |
@efiop this comment:
and this one:
seem to suggest that you're thinking about removing remote files, not local ones. Isn't that the opposite of the requested feature? Am I misunderstanding something? |
So is there currently no supported command to clear the local cache (keeping files backed up in remote)? And as a workaround could you just delete the cache folder and run |
That all sounds right @mvkvc |
I came here looking for an equivalent of "git annex drop" for dvc. Is manually deleting the cache and the dvc-pulled file really the only option to free disk space? |
It seems the primary blocker here is UI. Let's make a decision so we could move forward. How about |
It would be useful to be able to clear up disk space on one's local machine by removing the cache files of specific targets.
The text was updated successfully, but these errors were encountered: