Reorganization of Code Generation #6

Keno · 2013-06-08T04:33:58Z

This reorganizes code generation and moves part of it to metaprogramming on load time, which apart from improving load times (at least on my machines) also has the added benefit of allowing us to easily switch to a different organizational structure if dynamic generation of types proves unsuitable for this task (since this package alone has more types than all of Base). Anyway, I tried to make the result as close as possible to the original and will send functional changes in a different pull request.

This also dependes on my previous two pull requests. If those are merged before this, I will rebase this branch as appropriate.

WARNING: I'm currently hunting a memory bug in Julia triggered by this. After that this should be good to merge, but I figured I'd put it up before.

amitmurthy · 2013-06-10T06:06:26Z

On my machine:

Julia startup time:

amitm@amitm-ThinkPad-T61:~/Work/julia/AWS.jl$ time julia -e "" 

real    0m8.059s
user    0m7.844s
sys 0m0.428s

With your patch

amitm@amitm-ThinkPad-T61:~/Work/julia/AWS.jl$ time julia -e "using AWS.EC2" 

real    0m38.923s
user    0m38.424s
sys 0m0.624s

Without your patch

amitm@amitm-ThinkPad-T61:~/Work/julia/AWS.jl$ time julia -e "using AWS.EC2" 

real    0m48.652s
user    0m48.256s
sys 0m0.536s

While the 25% gain is good, I think your patch will make it impossible for someone to refer to the source for types/member names/APIs.

What are your thoughts?

amitmurthy · 2013-06-10T06:45:21Z

Also, I don't understand where the speedup is coming from... since in both cases all types and methods are being defined at startup. Is it just the .jl file parsing time?

amitmurthy · 2013-06-10T08:54:42Z

A simpler way to reduce load time may be to split EC2 into separate packages by functionality. S3 also goes into its own package. and all of them have a dependency on a core AWS package which only has basic types and the actual request execution code. This way the user can only include the functionality required.

amitmurthy · 2013-06-10T10:15:51Z

On my 64-bit machine, both times (with/without patch) is identical at 10.2 seconds (of which 1.7 seconds is Julia startup time).

Also, sooner or later, we will have support for precompiled packages.

Keno · 2013-06-10T13:56:18Z

The main thought here is not startup time (though it's a nice benefit), but added flexibility and simplicity in code generation. While I see your argument about referring to the code for API, I think having 11000 lines of almost identical code is a terrible idea. Another thing I want to point out is that people really shouldn't be using the raw EC2 API. We need to figure out the right abstraction and provide that API and if people really really want to, they can use this API, but in that case they should refer to the EC2 API Documentation.

amitmurthy · 2013-06-10T16:02:29Z

Since all 11000 lines are generated, I don't see it as a big issue. Macros just hide the process.

Trust me, people will be using the raw EC2 API and tailoring it to their needs. Which is why I suggested people use test/ec2_utils.jl as a template and customize it for their needs. It is a little difficult to come up with a good abstraction at this stage. Maybe when we have a few more users. For example, lots of public datasets are available as ebs snapshots on ec2. We were discussing that folks would probably want to (programmatically), create a volume from the snapshot, make n copies of the volume, start n instances, mount volume on each instance, fire a parallel julia job, terminate the instances and release the copied volumes. This kind of stuff will be common enough that we can probably provide a few templates initially instead of a higher level api.

While the EC2 API documentation will be the only real guide for this package, it is very useful to see the types and API while writing code.

We can probably a) generate the types and apis only in a separate .jl file that is not included in the main package - it is just a reference file, never really loaded and available in a docs directory - will not have the XML parsing calls and Union(T, Nothing) just T - or b) generate the types as comments in the main file.

Anyways, send me a rebased pull request when you are ready and I'll test it out with the limited test script that I have.

amitmurthy · 2013-06-10T16:04:14Z

Have added you as a collaborator. You can create branches here itself.

ViralBShah · 2013-06-10T19:26:55Z

I believe there are two classes of users - those end users who want to offload compute to ec2 and those who want to do robust deployments on ec2. For the first class, we need a very simple interface that hides the complexity of ec2. The second class can use the current detailed interface.

Since the underlying interface is auto generated, perhaps we should be focussing on the user facing simple interface.

ViralBShah · 2013-06-10T19:28:49Z

Btw, I disagree about providing templates. I would prefer to have an evolving api over templates.

Keno · 2013-06-10T19:31:47Z

I agree with Viral about not having templates.

amitmurthy · 2013-06-11T02:55:35Z

OK. We can try building a higher level interface with test/ec2_utils.jl as a starting point. I would suggest however in keeping it very simple so that this interface can become an interface to multiple cloud platforms in the future - specifically OpenStack and Azure.

staticfloat · 2014-09-17T20:47:04Z

Is this issue relevant anymore?

Keno added 3 commits June 7, 2013 16:49

Add pretty printing for EC2 types

cff8cef

Fix EC2 Requests for Endpoints that are not TLDs

13d25c1

WIP: Reorganization

674618f

amitmurthy mentioned this pull request Sep 26, 2013

Improving package load times JuliaLang/julia#4373

Closed

amitmurthy closed this Sep 21, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reorganization of Code Generation #6

Reorganization of Code Generation #6

Keno commented Jun 8, 2013

amitmurthy commented Jun 10, 2013

amitmurthy commented Jun 10, 2013

amitmurthy commented Jun 10, 2013

amitmurthy commented Jun 10, 2013

Keno commented Jun 10, 2013

amitmurthy commented Jun 10, 2013

amitmurthy commented Jun 10, 2013

ViralBShah commented Jun 10, 2013

ViralBShah commented Jun 10, 2013

Keno commented Jun 10, 2013

amitmurthy commented Jun 11, 2013

staticfloat commented Sep 17, 2014

Reorganization of Code Generation #6

Reorganization of Code Generation #6

Conversation

Keno commented Jun 8, 2013

amitmurthy commented Jun 10, 2013

amitmurthy commented Jun 10, 2013

amitmurthy commented Jun 10, 2013

amitmurthy commented Jun 10, 2013

Keno commented Jun 10, 2013

amitmurthy commented Jun 10, 2013

amitmurthy commented Jun 10, 2013

ViralBShah commented Jun 10, 2013

ViralBShah commented Jun 10, 2013

Keno commented Jun 10, 2013

amitmurthy commented Jun 11, 2013

staticfloat commented Sep 17, 2014