Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-6122][Core] Upgrade Tachyon client version to 0.6.1. #4867

Closed
wants to merge 3 commits into from

Conversation

calvinjia
Copy link
Contributor

Changes the Tachyon client version from 0.5 to 0.6 in spark core and distribution script.

New dependencies in Tachyon 0.6.0 include

commons-codec:commons-codec:jar:1.5:compile
io.netty:netty-all:jar:4.0.23.Final:compile

These are already in spark core.

@AmplabJenkins
Copy link

Can one of the admins verify this patch?

@haoyuan
Copy link
Contributor

haoyuan commented Mar 3, 2015

@aarondav @pwendell

@calvinjia calvinjia closed this Mar 3, 2015
@zzcclp
Copy link
Contributor

zzcclp commented Mar 4, 2015

why to close this RP?

@calvinjia
Copy link
Contributor Author

@zzcclp Tachyon client has a new api so I will update the relevant spark code and test before reopening this.

@calvinjia calvinjia reopened this Mar 4, 2015
@AmplabJenkins
Copy link

Can one of the admins verify this patch?

@calvinjia
Copy link
Contributor Author

@aarondav @pwendell
I've updated the client interface and tested it with basic count/wordcount and off heap storage.

@haoyuan
Copy link
Contributor

haoyuan commented Mar 6, 2015

Ok to test.

@haoyuan
Copy link
Contributor

haoyuan commented Mar 6, 2015

jenkins, test this please

@haoyuan
Copy link
Contributor

haoyuan commented Mar 6, 2015

Jenkins, test this please

@SparkQA
Copy link

SparkQA commented Mar 6, 2015

Test build #28323 has started for PR 4867 at commit 11907b3.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Mar 6, 2015

Test build #28323 has finished for PR 4867 at commit 11907b3.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/28323/
Test PASSed.

@calvinjia
Copy link
Contributor Author

@haoyuan @pwendell @aarondav

I've updated the version to the recently released 0.6.1. The dependency info is the same and the api has not changed.

@calvinjia calvinjia changed the title [SPARK-6122][Core] Upgrade Tachyon client version to 0.6.0. [SPARK-6122][Core] Upgrade Tachyon client version to 0.6.1. Mar 18, 2015
@aarondav
Copy link
Contributor

LGTM.

@pwendell Since this touches Spark's dependencies, would you like to sign off?

@pwendell
Copy link
Contributor

LGTM

@aarondav
Copy link
Contributor

Merged into master, thanks!

@asfgit asfgit closed this in a41b9c6 Mar 22, 2015
@JoshRosen
Copy link
Contributor

The master SBT build is currently broken for a few Hadoop profiles due to dependency issues. Do you think that this patch may have been responsible? I noticed that it wasn't tested by Jenkins prior to being merged (the last test was 18 days ago with an earlier version of the patch). See https://amplab.cs.berkeley.edu/jenkins/job/Spark-Master-SBT/1940/

@calvinjia
Copy link
Contributor Author

@JoshRosen Just from a quick glance at the output log, it seems to be a style issue (line > 100 characters). I don't think this patch should have caused the issues, since the errors have been the same as the ones since build #1937.

I think the checkstyle issues were introduced here?
ab4f516

@srowen
Copy link
Member

srowen commented Mar 23, 2015

@calvinjia yup this was fall-out from my mistake yesterday at https://github.com/apache/spark/pull/5128/files It's already patched thanks to Reynold, so that should be fine now.

@calvinjia
Copy link
Contributor Author

@srowen Oh I see the build still failed after the fix to that patch. Strange that there would be issues between now and the last test run for this patch since there should not have been any dependency changes.

@JoshRosen
Copy link
Contributor

Patrick reverted this patch in 6cd7058 and it appears that reverting it has fixed the builds: https://amplab.cs.berkeley.edu/jenkins/job/Spark-Master-SBT/1947/

@aarondav
Copy link
Contributor

The build only broke when compiled against other versions of Hadoop, so
it's likely that some of the dependency changes overwrote the deps of other
Hadoop versions.

On Mon, Mar 23, 2015 at 4:34 PM, Josh Rosen [email protected]
wrote:

Patrick reverted this patch in 6cd7058
6cd7058
and it appears that reverting it has fixed the builds:
https://amplab.cs.berkeley.edu/jenkins/job/Spark-Master-SBT/1947/


Reply to this email directly or view it on GitHub
#4867 (comment).

@syepes
Copy link

syepes commented Apr 2, 2015

is there any change this PR will get merged?

@srowen
Copy link
Member

srowen commented Apr 2, 2015

@syepes I think the problem would have to be fixed no? you can try applying the patch an examining what changes about dependencies in the builds that broke. Narrowing it down that much may help everyone understand how to fix it.

@calvinjia
Copy link
Contributor Author

@srowen @syepes I'm planning to re-open this when Tachyon 0.6.2 is available. There were two problems encountered in the tests:

  1. HTTPClient version mismatch: Selenium (used for UI tests) requires version 4.3.x, and Tachyon included 4.2.5 through a transitive dependency of its shaded thrift jar. To address this, Tachyon 0.6.2 will promote the transitive dependencies of the shaded jar so they can be excluded in spark.
  2. Jackson-Mapper-ASL version mismatch: In lower versions of hadoop-client (ie. 1.0.4), version 1.0.1 is included. The parquet library used in spark sql requires version 1.8+. Its unclear to me why upgrading tachyon-client would cause this dependency to break. The solution was to exclude jackson-mapper-asl from hadoop-client. If there is a better way to handle this issue, please let me know.

asfgit pushed a commit that referenced this pull request Apr 24, 2015
This is a reopening of #4867.
A short summary of the issues resolved from the previous PR:

1. HTTPClient version mismatch: Selenium (used for UI tests) requires version 4.3.x, and Tachyon included 4.2.5 through a transitive dependency of its shaded thrift jar. To address this, Tachyon 0.6.3 will promote the transitive dependencies of the shaded jar so they can be excluded in spark.

2. Jackson-Mapper-ASL version mismatch: In lower versions of hadoop-client (ie. 1.0.4), version 1.0.1 is included. The parquet library used in spark sql requires version 1.8+. Its unclear to me why upgrading tachyon-client would cause this dependency to break. The solution was to exclude jackson-mapper-asl from hadoop-client.

It seems that the dependency management in spark-parent will not work on transitive dependencies, one way to make sure jackson-mapper-asl is included with the correct version is to add it as a top level dependency. The best solution would be to exclude the dependency in the modules which require a higher version, but that did not fix the unit tests. Any suggestions on the best way to solve this would be appreciated!

Author: Calvin Jia <[email protected]>

Closes #5354 from calvinjia/upgrade_tachyon_0.6.3 and squashes the following commits:

0eefe4d [Calvin Jia] Handle httpclient version in maven dependency management. Remove httpclient version setting from profiles.
7c00dfa [Calvin Jia] Set httpclient version to 4.3.2 for selenium. Specify version of httpclient for sql/hive (previously 4.2.5 transitive dependency of libthrift).
9263097 [Calvin Jia] Merge master to test latest changes
dbfc1bd [Calvin Jia] Use Tachyon 0.6.4 for cleaner dependencies.
e2ff80a [Calvin Jia] Exclude the jetty and curator promoted dependencies from tachyon-client.
a3a29da [Calvin Jia] Update tachyon-client exclusions.
0ae6c97 [Calvin Jia] Change tachyon version to 0.6.3
a204df9 [Calvin Jia] Update make distribution tachyon version.
a93c94f [Calvin Jia] Exclude jackson-mapper-asl from hadoop client since it has a lower version than spark's expected version.
a8a923c [Calvin Jia] Exclude httpcomponents from Tachyon
910fabd [Calvin Jia] Update to master
eed9230 [Calvin Jia] Update tachyon version to 0.6.1.
11907b3 [Calvin Jia] Use TachyonURI for tachyon paths instead of strings.
71bf441 [Calvin Jia] Upgrade Tachyon client version to 0.6.0.
jeanlyn pushed a commit to jeanlyn/spark that referenced this pull request May 14, 2015
This is a reopening of apache#4867.
A short summary of the issues resolved from the previous PR:

1. HTTPClient version mismatch: Selenium (used for UI tests) requires version 4.3.x, and Tachyon included 4.2.5 through a transitive dependency of its shaded thrift jar. To address this, Tachyon 0.6.3 will promote the transitive dependencies of the shaded jar so they can be excluded in spark.

2. Jackson-Mapper-ASL version mismatch: In lower versions of hadoop-client (ie. 1.0.4), version 1.0.1 is included. The parquet library used in spark sql requires version 1.8+. Its unclear to me why upgrading tachyon-client would cause this dependency to break. The solution was to exclude jackson-mapper-asl from hadoop-client.

It seems that the dependency management in spark-parent will not work on transitive dependencies, one way to make sure jackson-mapper-asl is included with the correct version is to add it as a top level dependency. The best solution would be to exclude the dependency in the modules which require a higher version, but that did not fix the unit tests. Any suggestions on the best way to solve this would be appreciated!

Author: Calvin Jia <[email protected]>

Closes apache#5354 from calvinjia/upgrade_tachyon_0.6.3 and squashes the following commits:

0eefe4d [Calvin Jia] Handle httpclient version in maven dependency management. Remove httpclient version setting from profiles.
7c00dfa [Calvin Jia] Set httpclient version to 4.3.2 for selenium. Specify version of httpclient for sql/hive (previously 4.2.5 transitive dependency of libthrift).
9263097 [Calvin Jia] Merge master to test latest changes
dbfc1bd [Calvin Jia] Use Tachyon 0.6.4 for cleaner dependencies.
e2ff80a [Calvin Jia] Exclude the jetty and curator promoted dependencies from tachyon-client.
a3a29da [Calvin Jia] Update tachyon-client exclusions.
0ae6c97 [Calvin Jia] Change tachyon version to 0.6.3
a204df9 [Calvin Jia] Update make distribution tachyon version.
a93c94f [Calvin Jia] Exclude jackson-mapper-asl from hadoop client since it has a lower version than spark's expected version.
a8a923c [Calvin Jia] Exclude httpcomponents from Tachyon
910fabd [Calvin Jia] Update to master
eed9230 [Calvin Jia] Update tachyon version to 0.6.1.
11907b3 [Calvin Jia] Use TachyonURI for tachyon paths instead of strings.
71bf441 [Calvin Jia] Upgrade Tachyon client version to 0.6.0.
nemccarthy pushed a commit to nemccarthy/spark that referenced this pull request Jun 19, 2015
This is a reopening of apache#4867.
A short summary of the issues resolved from the previous PR:

1. HTTPClient version mismatch: Selenium (used for UI tests) requires version 4.3.x, and Tachyon included 4.2.5 through a transitive dependency of its shaded thrift jar. To address this, Tachyon 0.6.3 will promote the transitive dependencies of the shaded jar so they can be excluded in spark.

2. Jackson-Mapper-ASL version mismatch: In lower versions of hadoop-client (ie. 1.0.4), version 1.0.1 is included. The parquet library used in spark sql requires version 1.8+. Its unclear to me why upgrading tachyon-client would cause this dependency to break. The solution was to exclude jackson-mapper-asl from hadoop-client.

It seems that the dependency management in spark-parent will not work on transitive dependencies, one way to make sure jackson-mapper-asl is included with the correct version is to add it as a top level dependency. The best solution would be to exclude the dependency in the modules which require a higher version, but that did not fix the unit tests. Any suggestions on the best way to solve this would be appreciated!

Author: Calvin Jia <[email protected]>

Closes apache#5354 from calvinjia/upgrade_tachyon_0.6.3 and squashes the following commits:

0eefe4d [Calvin Jia] Handle httpclient version in maven dependency management. Remove httpclient version setting from profiles.
7c00dfa [Calvin Jia] Set httpclient version to 4.3.2 for selenium. Specify version of httpclient for sql/hive (previously 4.2.5 transitive dependency of libthrift).
9263097 [Calvin Jia] Merge master to test latest changes
dbfc1bd [Calvin Jia] Use Tachyon 0.6.4 for cleaner dependencies.
e2ff80a [Calvin Jia] Exclude the jetty and curator promoted dependencies from tachyon-client.
a3a29da [Calvin Jia] Update tachyon-client exclusions.
0ae6c97 [Calvin Jia] Change tachyon version to 0.6.3
a204df9 [Calvin Jia] Update make distribution tachyon version.
a93c94f [Calvin Jia] Exclude jackson-mapper-asl from hadoop client since it has a lower version than spark's expected version.
a8a923c [Calvin Jia] Exclude httpcomponents from Tachyon
910fabd [Calvin Jia] Update to master
eed9230 [Calvin Jia] Update tachyon version to 0.6.1.
11907b3 [Calvin Jia] Use TachyonURI for tachyon paths instead of strings.
71bf441 [Calvin Jia] Upgrade Tachyon client version to 0.6.0.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

10 participants