Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

QUESTION: Read from Stream #8

Open
stugorf opened this issue Feb 28, 2019 · 3 comments
Open

QUESTION: Read from Stream #8

stugorf opened this issue Feb 28, 2019 · 3 comments

Comments

@stugorf
Copy link

stugorf commented Feb 28, 2019

I want to read an Excel file from an AWS S3 bucket without downloading it. This is how I normally read a file from a stream. Can ExcelReaders.jl read from a stream?

file_content = S3.get_object(aws, get_config);
load(file_content, "Sheet1") |> DataFrame

Thank you.

@davidanthoff
Copy link
Member

I'm afraid not. I don't think the underlying Python xlrd library that we use supports it...

We are working on a new package that uses the libxls library to read legacy excel files, and I think that package actually supports parsing from in-memory buffers...

I'm pretty sure we won't add this ability to ExcelReaders.jl, so I'm just going go move this issue over to LibXls.jl, and maybe we can sort it out there eventually.

@davidanthoff davidanthoff transferred this issue from queryverse/ExcelReaders.jl Feb 28, 2019
@stugorf
Copy link
Author

stugorf commented Mar 1, 2019

Thank you. I'll keep an out out for a release of LibXls.jl

@stugorf
Copy link
Author

stugorf commented Mar 1, 2019

Being able to do something like this would be great when working with Excel from AWS S3:

Read from AWS S3 file stream
df = load(IOBuffer(file_content)) |> DataFrame;

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants