[SPARK-5592][SQL] java.net.URISyntaxException when insert data to a partitioned table #4368

scwf · 2015-02-04T12:53:09Z

flowing sql get URISyntaxException:

create table sc as select * 
from (select '2011-01-11', '2011-01-11+14:18:26' from src tablesample (1 rows)
union all 
select '2011-01-11', '2011-01-11+15:18:26' from src tablesample (1 rows)
union all 
select '2011-01-11', '2011-01-11+16:18:26' from src tablesample (1 rows) ) s;
create table sc_part (key string) partitioned by (ts string) stored as rcfile;
set hive.exec.dynamic.partition=true;
set hive.exec.dynamic.partition.mode=nonstrict;
insert overwrite table sc_part partition(ts) select * from sc;

java.net.URISyntaxException: Relative path in absolute URI: ts=2011-01-11+15:18:26
at org.apache.hadoop.fs.Path.initialize(Path.java:206)
at org.apache.hadoop.fs.Path.(Path.java:172)
at org.apache.hadoop.fs.Path.(Path.java:94)
at org.apache.spark.sql.hive.SparkHiveDynamicPartitionWriterContainer.org$apache$spark$sql$hive$SparkHiveDynamicPartitionWriterContainer$$newWriter$1(hiveWriterContainers.scala:230)
at org.apache.spark.sql.hive.SparkHiveDynamicPartitionWriterContainer$$anonfun$getLocalFileWriter$1.apply(hiveWriterContainers.scala:243)
at org.apache.spark.sql.hive.SparkHiveDynamicPartitionWriterContainer$$anonfun$getLocalFileWriter$1.apply(hiveWriterContainers.scala:243)
at scala.collection.mutable.MapLike$class.getOrElseUpdate(MapLike.scala:189)
at scala.collection.mutable.AbstractMap.getOrElseUpdate(Map.scala:91)
at org.apache.spark.sql.hive.SparkHiveDynamicPartitionWriterContainer.getLocalFileWriter(hiveWriterContainers.scala:243)
at org.apache.spark.sql.hive.execution.InsertIntoHiveTable$$anonfun$org$apache$spark$sql$hive$execution$InsertIntoHiveTable$$writeToFile$1$1.apply(InsertIntoHiveTable.scala:113)
at org.apache.spark.sql.hive.execution.InsertIntoHiveTable$$anonfun$org$apache$spark$sql$hive$execution$InsertIntoHiveTable$$writeToFile$1$1.apply(InsertIntoHiveTable.scala:105)
at scala.collection.Iterator$class.foreach(Iterator.scala:727)
at scala.collection.AbstractIterator.foreach(Iterator.scala:1157)
at org.apache.spark.sql.hive.execution.InsertIntoHiveTable.org$apache$spark$sql$hive$execution$InsertIntoHiveTable$$writeToFile$1(InsertIntoHiveTable.scala:105)
at org.apache.spark.sql.hive.execution.InsertIntoHiveTable$$anonfun$saveAsHiveFile$3.apply(InsertIntoHiveTable.scala:87)
at org.apache.spark.sql.hive.execution.InsertIntoHiveTable$$anonfun$saveAsHiveFile$3.apply(InsertIntoHiveTable.scala:87)
at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:61)
at org.apache.spark.scheduler.Task.run(Task.scala:64)
at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:194)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603)
at java.lang.Thread.run(Thread.java:722)
Caused by: java.net.URISyntaxException: Relative path in absolute URI: ts=2011-01-11+15:18:26
at java.net.URI.checkPath(URI.java:1804)
at java.net.URI.(URI.java:752)
at org.apache.hadoop.fs.Path.initialize(Path.java:203)

SparkQA · 2015-02-04T12:57:53Z

Test build #26751 has started for PR 4368 at commit ea81daf.

This patch merges cleanly.

scwf · 2015-02-04T13:14:56Z

The root cause is there is : in the partitioned fields 2011-01-11+14:18:26

todo: add test for this

scwf · 2015-02-04T13:15:12Z

/cc @liancheng

SparkQA · 2015-02-04T14:09:05Z

Test build #26751 has finished for PR 4368 at commit ea81daf.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2015-02-04T14:09:08Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/26751/
Test PASSed.

SparkQA · 2015-02-05T02:02:57Z

Test build #26809 has started for PR 4368 at commit f8f8bb1.

This patch merges cleanly.

SparkQA · 2015-02-05T03:16:44Z

Test build #26809 has finished for PR 4368 at commit f8f8bb1.

This patch passes all tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
- class UserDefinedFunction(object):
- abstract class NumericType extends NativeType with PrimitiveType

AmplabJenkins · 2015-02-05T03:16:47Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/26809/
Test PASSed.

marmbrus · 2015-02-06T21:12:40Z

sql/hive/src/main/scala/org/apache/spark/sql/hive/hiveWriterContainers.scala

-      }
-      .mkString
+        s"/$col=${
+          if (string == null || string.isEmpty) {


Can you move this computation out of string interpolation into a variable. Its kinda odd to have a multi-line string thats not really multi line.

SparkQA · 2015-02-07T01:27:34Z

Test build #26980 has started for PR 4368 at commit aa55ef4.

This patch merges cleanly.

SparkQA · 2015-02-07T02:43:01Z

Test build #26980 has finished for PR 4368 at commit aa55ef4.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2015-02-07T02:43:04Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/26980/
Test PASSed.

scwf · 2015-02-10T01:10:33Z

Updated.

liancheng · 2015-02-10T19:43:59Z

Is FileUtils.escapePathName also the standard way to handle this case in Hive?

liancheng · 2015-02-10T19:47:21Z

Yeah, Hive also uses FileUtils.escapePathName to handle partition directories.

LGTM, thanks! Merging into master and branch-1.3.

…artitioned table flowing sql get URISyntaxException: ``` create table sc as select * from (select '2011-01-11', '2011-01-11+14:18:26' from src tablesample (1 rows) union all select '2011-01-11', '2011-01-11+15:18:26' from src tablesample (1 rows) union all select '2011-01-11', '2011-01-11+16:18:26' from src tablesample (1 rows) ) s; create table sc_part (key string) partitioned by (ts string) stored as rcfile; set hive.exec.dynamic.partition=true; set hive.exec.dynamic.partition.mode=nonstrict; insert overwrite table sc_part partition(ts) select * from sc; ``` java.net.URISyntaxException: Relative path in absolute URI: ts=2011-01-11+15:18:26 at org.apache.hadoop.fs.Path.initialize(Path.java:206) at org.apache.hadoop.fs.Path.<init>(Path.java:172) at org.apache.hadoop.fs.Path.<init>(Path.java:94) at org.apache.spark.sql.hive.SparkHiveDynamicPartitionWriterContainer.org$apache$spark$sql$hive$SparkHiveDynamicPartitionWriterContainer$$newWriter$1(hiveWriterContainers.scala:230) at org.apache.spark.sql.hive.SparkHiveDynamicPartitionWriterContainer$$anonfun$getLocalFileWriter$1.apply(hiveWriterContainers.scala:243) at org.apache.spark.sql.hive.SparkHiveDynamicPartitionWriterContainer$$anonfun$getLocalFileWriter$1.apply(hiveWriterContainers.scala:243) at scala.collection.mutable.MapLike$class.getOrElseUpdate(MapLike.scala:189) at scala.collection.mutable.AbstractMap.getOrElseUpdate(Map.scala:91) at org.apache.spark.sql.hive.SparkHiveDynamicPartitionWriterContainer.getLocalFileWriter(hiveWriterContainers.scala:243) at org.apache.spark.sql.hive.execution.InsertIntoHiveTable$$anonfun$org$apache$spark$sql$hive$execution$InsertIntoHiveTable$$writeToFile$1$1.apply(InsertIntoHiveTable.scala:113) at org.apache.spark.sql.hive.execution.InsertIntoHiveTable$$anonfun$org$apache$spark$sql$hive$execution$InsertIntoHiveTable$$writeToFile$1$1.apply(InsertIntoHiveTable.scala:105) at scala.collection.Iterator$class.foreach(Iterator.scala:727) at scala.collection.AbstractIterator.foreach(Iterator.scala:1157) at org.apache.spark.sql.hive.execution.InsertIntoHiveTable.org$apache$spark$sql$hive$execution$InsertIntoHiveTable$$writeToFile$1(InsertIntoHiveTable.scala:105) at org.apache.spark.sql.hive.execution.InsertIntoHiveTable$$anonfun$saveAsHiveFile$3.apply(InsertIntoHiveTable.scala:87) at org.apache.spark.sql.hive.execution.InsertIntoHiveTable$$anonfun$saveAsHiveFile$3.apply(InsertIntoHiveTable.scala:87) at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:61) at org.apache.spark.scheduler.Task.run(Task.scala:64) at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:194) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603) at java.lang.Thread.run(Thread.java:722) Caused by: java.net.URISyntaxException: Relative path in absolute URI: ts=2011-01-11+15:18:26 at java.net.URI.checkPath(URI.java:1804) at java.net.URI.<init>(URI.java:752) at org.apache.hadoop.fs.Path.initialize(Path.java:203) Author: wangfei <[email protected]> Author: Fei Wang <[email protected]> Closes #4368 from scwf/SPARK-5592 and squashes the following commits: aa55ef4 [Fei Wang] comments addressed f8f8bb1 [wangfei] added test case f24624f [wangfei] Merge branch 'master' of https://github.com/apache/spark into SPARK-5592 9998177 [wangfei] added test case ea81daf [wangfei] fix URISyntaxException (cherry picked from commit 59272da) Signed-off-by: Cheng Lian <[email protected]>

fix URISyntaxException

ea81daf

scwf changed the title ~~]fix URISyntaxException~~ [SPARK-5592][SQL] java.net.URISyntaxException when insert data to a partitioned table Feb 4, 2015

scwf changed the title ~~[SPARK-5592][SQL] java.net.URISyntaxException when insert data to a partitioned table~~ [SPARK-5592][SQL][WIP] java.net.URISyntaxException when insert data to a partitioned table Feb 4, 2015

scwf added 3 commits February 5, 2015 08:55

added test case

9998177

Merge branch 'master' of https://github.com/apache/spark into SPARK-5592

f24624f

added test case

f8f8bb1

scwf changed the title ~~[SPARK-5592][SQL][WIP] java.net.URISyntaxException when insert data to a partitioned table~~ [SPARK-5592][SQL] java.net.URISyntaxException when insert data to a partitioned table Feb 5, 2015

marmbrus reviewed Feb 6, 2015
View reviewed changes

comments addressed

aa55ef4

asfgit closed this in 59272da Feb 10, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-5592][SQL] java.net.URISyntaxException when insert data to a partitioned table #4368

[SPARK-5592][SQL] java.net.URISyntaxException when insert data to a partitioned table #4368

scwf commented Feb 4, 2015

SparkQA commented Feb 4, 2015

scwf commented Feb 4, 2015

scwf commented Feb 4, 2015

SparkQA commented Feb 4, 2015

AmplabJenkins commented Feb 4, 2015

SparkQA commented Feb 5, 2015

SparkQA commented Feb 5, 2015

AmplabJenkins commented Feb 5, 2015

marmbrus Feb 6, 2015

SparkQA commented Feb 7, 2015

SparkQA commented Feb 7, 2015

AmplabJenkins commented Feb 7, 2015

scwf commented Feb 10, 2015

liancheng commented Feb 10, 2015

liancheng commented Feb 10, 2015

[SPARK-5592][SQL] java.net.URISyntaxException when insert data to a partitioned table #4368

[SPARK-5592][SQL] java.net.URISyntaxException when insert data to a partitioned table #4368

Conversation

scwf commented Feb 4, 2015

SparkQA commented Feb 4, 2015

scwf commented Feb 4, 2015

scwf commented Feb 4, 2015

SparkQA commented Feb 4, 2015

AmplabJenkins commented Feb 4, 2015

SparkQA commented Feb 5, 2015

SparkQA commented Feb 5, 2015

AmplabJenkins commented Feb 5, 2015

marmbrus Feb 6, 2015

Choose a reason for hiding this comment

SparkQA commented Feb 7, 2015

SparkQA commented Feb 7, 2015

AmplabJenkins commented Feb 7, 2015

scwf commented Feb 10, 2015

liancheng commented Feb 10, 2015

liancheng commented Feb 10, 2015