Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement Path type #5635

Merged
merged 2 commits into from
Mar 29, 2019
Merged

Conversation

straight-shoota
Copy link
Member

@straight-shoota straight-shoota commented Jan 24, 2018

This is a prototype of a Path type encapsulating filesystem paths to have a basis for further discussion in #5550

For now I have only refactored HTTP::StaticFileHandler to use the Path type to see how it can be used.

@@ -537,16 +537,16 @@ describe "File" do
end

it "expand_path for commoms unix path give a full path" do
File.expand_path("/tmp/").should eq("/tmp")
File.expand_path("/tmp/").should eq("/tmp/")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Surely all the specs on file that deal with pure paths should be moved to path_spec. We needn't spec the Path wrapper and Path.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wanted to leave it in until it gets completely replaced. This way it's easy to see that the behaviour is mostly the same except for a few details.

spec/std/path_spec.cr Outdated Show resolved Hide resolved
spec/std/path_spec.cr Outdated Show resolved Hide resolved
spec/std/path_spec.cr Outdated Show resolved Hide resolved
spec/std/path_spec.cr Outdated Show resolved Hide resolved
src/path.cr Outdated Show resolved Hide resolved
src/path.cr Outdated Show resolved Hide resolved
src/path.cr Outdated Show resolved Hide resolved
src/path.cr Outdated Show resolved Hide resolved
end
end

# Returns the root path component of this path or `nil` if it is not rooted.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What's the usecase for this?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not entirely sure myself. But both Java and Python API's have this, so I'd guess it's probably useful for something... internally it's used by #anchor.

@straight-shoota straight-shoota force-pushed the jm/feature/path-type branch 3 times, most recently from 5297198 to 3a9e01b Compare January 24, 2018 18:34
@straight-shoota
Copy link
Member Author

Something is really strange with the compiler specs using Path...

The support/syntax helper includes Crystal into the top level namespace, so all over the specs Path is used to reference Crystal::Path. Now, the top level type ::Path overwrites that and of the specs no longer compile because types mismatch.

With 5ff295d I tried to replace Path with qualified Crystal::Path to use the correct type.
There are however some strange issues with some specs, for example spec/compiler/macro/macro_methods_spec.cr:1209:

assert_macro "x", %({{x.inputs}}), [ProcNotation.new([Crystal::Path.new("SomeType")] of ASTNode, Crystal::Path.new("SomeResult"))] of ASTNode, "[SomeType]"

This line produces a syntax error: expecting identifier 'class', not 'new'.

Previously it worked with Path.new. I can't really see what's the issue here... ist there any special syntax related to that example?

@asterite
Copy link
Member

@straight-shoota

[] of X, Y -> Z

That's an array of procs.

Problem is, if you write

foo x, [] of X, Y

then compiler will think X, Y... refers to the type of the upcoming block, which never comes.

Solution: use parentheses:

foo x, ([] of X), Y

@straight-shoota
Copy link
Member Author

But why is this an issue? The change just replaces Path with Crystal::Path...

@Sija
Copy link
Contributor

Sija commented Jan 24, 2018

I'd suggest changing name to Pathname as in Ruby since it clearly tells you the purpose of this Class, Path is IMO way too generic (hence collision with Crystal::Path).

@RX14
Copy link
Contributor

RX14 commented Jan 25, 2018

Pathname is just ugly... And its longer for what is hopefully a very well used class.

@asterite
Copy link
Member

We just need not to include Crystal in our specs.

The language is ugly regarding types, there are no nice "imports" and no way to prevent types leaking all over the global namespace. We just have to accept that and move on.

@bew
Copy link
Contributor

bew commented Jan 25, 2018

@asterite I think it's ok to include Crystal only for compiler specs, I don't think it's useful anywhere else.
For this we'd need to compile specs one category at a time (std, then compiler, then integration, etc..). I think this has also other advantages other than better controlling global namespace pollution, like less memory needed to build&run specs.

@asterite
Copy link
Member

@bew problem is, Path will be required by prelude (it's used in File, which is required by prelude), so including Crystal will create a conflict with Path.

@straight-shoota
Copy link
Member Author

Can't we just put the specs in Crystal namespace? That way Path would point to Crystal::Path (ASTNode) and Path (file system path) could by accessed as ::Path.

@RX14
Copy link
Contributor

RX14 commented Jan 25, 2018

I still don't understand why the parsing changes based on the addition of a qualifying name space.

You can have imports using private alias at the top level but you're always going to have a prefix.

I'm fine with putting the specs in a module.

@asterite
Copy link
Member

Now I don't think the parsing changes or has anything to do with what I said above. I'll checkout this branch and investigate.

@asterite
Copy link
Member

Found it. If you do:

foo([] of T, Y.new)

it parses "well" (Y.new) is a separate argument. The compiler will try to check if what comes after a comma can be a type, but because .new follows it's not.

However, with:

foo([] of T, X::Y.new)

the compiler fails to see this, because of the ::. We are just missing that rule in the parser.

Copy link
Contributor

@RX14 RX14 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this is pretty good on the API design side. I have lots of comments on the implementation but i'd rather @asterite, @ysbaddaden etc weighed in on the API first.

src/http/server/handlers/static_file_handler.cr Outdated Show resolved Hide resolved
src/path.cr Show resolved Hide resolved
src/path.cr Outdated Show resolved Hide resolved
@RX14
Copy link
Contributor

RX14 commented Apr 12, 2018

Would be good to get some more design opinions on this PR from the core team. I'm not fond of putting so much behaviour on the Type enum...

@asterite
Copy link
Member

The more types we add to the std that's loaded by default (and Path will be one such type), the longer it takes to compile an empty file. So until we resolve that (and I'm more and more inclined to change the language for good) I can't comment on this. Well, my initial reaction is that I needed needed a Path type. I do need to split a path, or get the extension, or the base name, but all of that already exists as File class methods. Maybe they could be moved to a Path type, but meh.

@RX14
Copy link
Contributor

RX14 commented Apr 12, 2018

but all of that already exists as File class methods

The obvious response to that is that path operations tend to be composed quite often, and providing a path type makes composing these operations a lot easier because you don't have to specify File. every time.

I'm not particularly sympathetic to the compile time argument - I think that Path is one of the few things missing from the corelib, and after that is there really anything left to add?

Things like deque and set belong in the corelib less than path, but they're required by other parts of the corelib.

@asterite
Copy link
Member

because you don't have to specify File. every time

Having to type less is never a good argument in my opinion (or at least I also thought like that, but not anymore)

I think that Path is one of the few things missing from the corelib, and after that is there really anything left to add?

  • A precise GC
  • Multithread support (whatever that is), with operations to simplify coding (work groups, sync operations, etc.)
  • printf is not yet good/complete in my opinion
  • I also always wanted to have scanf (I did a quick proof of concept in the past and it worked, though I think it won't work anymore with how I reimagine the language)
  • And possibly other things I might be forgetting

@RX14
Copy link
Contributor

RX14 commented Apr 12, 2018

Having to type less is never a good argument in my opinion

No, having to type less is never a good reason to sacrifice readability or understandability (i.e. adding too much magic), but here we do neither. The only thing we sacrifice for having to type less is adding a new class to the stdlib. Considering the current File primitives are demonstrably woefully inadequate on windows, and we'd have to basically rewrite them all anyway (see file_spec.cr in #5623), why not migrate to using Path at the same time.

A precise GC

will be written in C, rust (unlikely), not crystal. It's a GC'ed language itself - it's not suited for writing a GC.

Multithread support (whatever that is), with operations to simplify coding (work groups, sync operations, etc.)

fair enough, although I feel a lot of the higher-level concurrency abstractions can be in the stdlib not the corelib

printf is not yet good/complete in my opinion

That's more code, but not really a whole new class. If the code is only string ops and maths, then it will not be instantiated more than once, and will contribute much less to compile times.

I also always wanted to have scanf (I did a quick proof of concept in the past and it worked, though I think it won't work anymore with how I reimagine the language)

I don't think this is a good idea, writing lightweight parsers by hand is super easy in crystal, usually more robust, and generate better errors.

@asterite
Copy link
Member

will be written in C, rust (unlikely), not crystal. It's a GC'ed language itself - it's not suited for writing a GC.

Go's GC is written in Go ;-)

@RX14
Copy link
Contributor

RX14 commented Apr 12, 2018

Go also doesn't have numerous places where hidden allocations occur. Even ones you wouldn't think about like .as.

@straight-shoota
Copy link
Member Author

straight-shoota commented Apr 12, 2018

Can we please keep this thread clear of discussions about GC implementations and other Stdlib features? The gist is: this adds an additional type to corelib. We're also pretty sure that there will be no or only very few additions to that in the future. Having the class makes many things easier to write, but on the other hand adds an additional type to the inference algorithm for every program.
That's a trade-off to consider.

@RX14 I agree that having the behaviour defined on the enum is not great. It can easily be moved to private methods on Path, maybe with just a little bit more to type. Anyway, that's just an implementation detail.

@RX14 RX14 removed this from the 0.28.0 milestone Dec 4, 2018
@RX14
Copy link
Contributor

RX14 commented Dec 4, 2018

There should be no breaking changes here so this should be fine for 0.27.1

@straight-shoota
Copy link
Member Author

There is technically a minor breaking change in File.expand_path which now preserves trailing slashes. If that's a hold back we could just revert the refactoring of that specific method - or all File methods for that matter. The focus should be on getting Path in the stdlib.

@asterite asterite mentioned this pull request Dec 7, 2018
@RX14
Copy link
Contributor

RX14 commented Dec 15, 2018

@bcardiff this is still blocking a lot of Windows specs, i'd love it reviewed and merged in the next 2 weeks so I can resume working on Windows over Christmas.

@sudo-nice
Copy link

Any chance this PR will make it into 0.27.1?

@straight-shoota
Copy link
Member Author

No, there are breaking changes to File class methods. It's already milestoned for 0.28.0 and needs a review from @bcardiff
I've rebased and squashed on current master.

@sudo-nice
Copy link

@straight-shoota Thank you.

src/path.cr Outdated Show resolved Hide resolved
src/path.cr Outdated Show resolved Hide resolved
@straight-shoota
Copy link
Member Author

Thanks, @r00ster91. Fixed typos and rebased on master.

This introduces a new Path type representing a filesystem path.
ord = @name.compare(other.@name, case_insensitive: windows?)
return ord if ord != 0

@kind <=> other.@kind
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I find this strange, but we can probably fix it later. Why a POSIX path is less than a WINDOWS path?

(and if that makes sense... shouldn't it be the other way around? 😛)

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also the PartialComparable is no longer needed due to latest changes in Comparable. But that can be changed later.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hm, you are right! I can fix that in a PR after this one was merged (also removing PartialComparable).

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

removing deprecating 🙈

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, though it isn't used anywhere else according to GitHub: https://github.com/search?q=language%3Acrystal+PartialComparable&type=Code

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Regarding the original question: Two paths shouldn't be treated as equal if they are of different kinds, because that means different semantics. They shouldn't compare equal either, so this seems like a legitimate solution.
Could alternatively return nil of course when comparing different path types.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Returning nil sounds good to me

Copy link
Member

@asterite asterite left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's do this! Thank you everyone for the incredible commitment and patience on this one! 🙇

@bcardiff bcardiff merged commit 653a2a7 into crystal-lang:master Mar 29, 2019
@straight-shoota
Copy link
Member Author

Yeah, can't believe i's been over a year... 🤣

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.