Skip to content
This repository has been archived by the owner on Mar 18, 2024. It is now read-only.

Finding examples of openly accessible images #3

Open
jlstevens opened this issue Apr 25, 2018 · 20 comments
Open

Finding examples of openly accessible images #3

jlstevens opened this issue Apr 25, 2018 · 20 comments
Milestone

Comments

@jlstevens
Copy link
Contributor

The original version of landsat_spectral_clustering.ipynb used redding.tif which was obtained originally from planet.com. As I wasn't sure whether this image could be made available, I updated the notebook to use a landsat example taken from a datashader example.

@ebo then informed me by e-mail that this image is not really suitable as it only has two bands. We would like some example images that can be made public that also have a decent number of bands. This is important as we will then be able to compute the various indices that we might then want to learn on.

One suggestion by @ebo was to use these images of a disappearing lake although I only see a link to download them in JPG format?

Lastly, a new notebook has been committed referencing 'Midwest_Mosaic.tif' which I don't think we have discussed yet. Is this something we could slice down and add to the repo as an example?

@jbednar
Copy link
Collaborator

jbednar commented Apr 25, 2018

I'm not sure what you mean about this image having only two bands; it has 11 bands if you count panchromatic:

http://datashader.org/topics/landsat.html

@jbednar
Copy link
Collaborator

jbednar commented Apr 25, 2018

Oh, I think I see the confusion: The information from the different bands is in different files. You need all the files, if you want all the bands. So I don't think we actually need a new example here, you just have to read all the different bands in, from different files.

@jlstevens jlstevens changed the title Finding openly accessible images with sufficient bands Finding examples of openly accessible images Apr 25, 2018
@jlstevens
Copy link
Contributor Author

As the current example does have sufficient bands (across the relevant files) I've renamed this issue to reflect that we would still like a variety of good, publicly accessible image examples regardless.

@ebo
Copy link
Collaborator

ebo commented Apr 25, 2018 via email

@ebo
Copy link
Collaborator

ebo commented Apr 25, 2018 via email

@jlstevens
Copy link
Contributor Author

TIFFTAG_IMAGEDESCRIPTION=Created with GIMP

I wonder if that is what is responsible. It seemed to be the quickest way to slice the image but I guess it might not have preserved the bands.

@jbednar
Copy link
Collaborator

jbednar commented Apr 25, 2018

Right. @jlstevens, please rename the example file to indicate which original image you started with; the filename indicates which bands it was. And then we could consider stacking the various images into a single merged image, but the separate images are how the files were provided from LANDSAT.

@ebo
Copy link
Collaborator

ebo commented Apr 25, 2018 via email

@jlstevens
Copy link
Contributor Author

The 90% loss of Walker Lake water volume
is not only timely, but people people care about the background story.
If it is used for a tutorial on end-to-end how you do this stuff, then
we can demonstrate what actually happens when you work with these images
for real.

I agree that if we can tell a compelling story with the data we have then we should.

@jbednar
Copy link
Collaborator

jbednar commented Apr 25, 2018

Sounds good. It will be great to have additional examples showing other topics, and it will be great to keep the overall number of data files that people have to download and that we have to document low. But it's not the number of bands that would invalidate this example (as in Jean-Luc's original title), just that some other example may subsume it for other reasons. When that happens, great!

@ebo
Copy link
Collaborator

ebo commented Apr 25, 2018 via email

@jbednar
Copy link
Collaborator

jbednar commented Apr 25, 2018

There are some high-resolution CC BY-SA-licensed examples at https://info.planet.com/download-free-high-resolution-skysat-image-samples/, though none of them cover the same region of the earth at different times that I can see.

@jbednar
Copy link
Collaborator

jbednar commented Apr 25, 2018

In case it helps, the original files from the current example are available from: https://github.com/bokeh/datashader/blob/master/examples/datasets.yml#L36
and shouldn't have any of the problems from the Gimp-processed version. But if there are better examples that can be used to tell more stories, then bring them on! :-)

@ebo
Copy link
Collaborator

ebo commented Apr 25, 2018 via email

@jbednar
Copy link
Collaborator

jbednar commented Apr 25, 2018

Sounds good! No, there wasn't a compelling story about the original image, though there was intended to be. :-) It was originally chosen to try to show the differences between the actual coastline of Southern Louisiana around New Orleans and what standard maps show as the outline of Louisiana, but the wrong image was selected, and so it ended up being the wrong bit of coastline, not telling that story at all. So there's no problem replacing it with something that has a better story.

@mrocklin
Copy link
Collaborator

One other problem I stumbled on to is the ordering of the data -- is
this (bands, height, width) or (height, width, bands) or (height, bands,
width)... I think it is rasterio that has a helper function which
reorders the bands from the stacked 2D images to 3D image ordering.
Thinking about this a little I realized that I rarely have control over
how the data come to me, and this would suggest that a general
reordering/mapping method may be in order.

Can I ask you to expand on this a bit further? I suspect that Numpy and XArray both have mechanisms to help you here. In particular you might want to look into parts of their respective APIs that include stack and transpose functions depending on what you want.

@ebo
Copy link
Collaborator

ebo commented Apr 25, 2018 via email

@ebo
Copy link
Collaborator

ebo commented Apr 25, 2018 via email

@ebo
Copy link
Collaborator

ebo commented Dec 22, 2018

BTW, the author of the Nature Geosciences article sent me the data to replicate several of the published graphs. I have a number of other things on my plate at the moment, but if we extend the walker lake to either include the Great Salt Lake, or as a second notebook. We should be able to replicate the work in https://www.fs.fed.us/rm/pubs_journals/2017/rmrs_2017_wurtsbaugh_w001.pdf. Also, I have permission to publicly release the data with the agreement to properly cite and credit the work. I would also want to have them review this before formally releasing it if possible to make sure I/we do not make a mistake that would offend.

@jbednar jbednar added this to the wishlist milestone Oct 12, 2020
@jbednar
Copy link
Collaborator

jbednar commented Oct 12, 2020

A recently announced source of freely available hi-res imagery:

https://twitter.com/planetlabs/status/1308768058098450437

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants