Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-18917][SQL] Remove schema check in appending data #16622

Closed
wants to merge 1 commit into from

Conversation

rxin
Copy link
Contributor

@rxin rxin commented Jan 17, 2017

What changes were proposed in this pull request?

In append mode, we check whether the schema of the write is compatible with the schema of the existing data. It can be a significant performance issue in cloud environment to find the existing schema for files. This patch removes the check.

Note that for catalog tables, we always do the check, as discussed in #16339 (comment)

How was this patch tested?

N/A

Closes #16339.

@SparkQA
Copy link

SparkQA commented Jan 17, 2017

Test build #71524 has finished for PR 16622 at commit 25272e9.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@gatorsmile
Copy link
Member

LGTM

@rxin
Copy link
Contributor Author

rxin commented Jan 17, 2017

Merging in master.

@asfgit asfgit closed this in 83dff87 Jan 17, 2017
uzadude pushed a commit to uzadude/spark that referenced this pull request Jan 27, 2017
## What changes were proposed in this pull request?
In append mode, we check whether the schema of the write is compatible with the schema of the existing data. It can be a significant performance issue in cloud environment to find the existing schema for files. This patch removes the check.

Note that for catalog tables, we always do the check, as discussed in apache#16339 (comment)

## How was this patch tested?
N/A

Closes apache#16339.

Author: Reynold Xin <[email protected]>

Closes apache#16622 from rxin/SPARK-18917.
cmonkey pushed a commit to cmonkey/spark that referenced this pull request Feb 15, 2017
## What changes were proposed in this pull request?
In append mode, we check whether the schema of the write is compatible with the schema of the existing data. It can be a significant performance issue in cloud environment to find the existing schema for files. This patch removes the check.

Note that for catalog tables, we always do the check, as discussed in apache#16339 (comment)

## How was this patch tested?
N/A

Closes apache#16339.

Author: Reynold Xin <[email protected]>

Closes apache#16622 from rxin/SPARK-18917.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants