Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update #1

Merged
merged 822 commits into from
Feb 16, 2016
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
822 commits
Select commit Hold shift + click to select a range
5f56980
[SPARK-11137][STREAMING] Make StreamingContext.stop() exception-safe
jayadevanmurali Jan 23, 2016
423783a
[SPARK-12904][SQL] Strength reduction for integral and decimal litera…
rxin Jan 23, 2016
cfdcef7
[STREAMING][MINOR] Scaladoc + logs
jaceklaskowski Jan 23, 2016
f400460
[SPARK-12971] Fix Hive tests which fail in Hadoop-2.3 SBT build
JoshRosen Jan 24, 2016
a834001
[SPARK-10498][TOOLS][BUILD] Add requirements.txt file for dev python …
holdenk Jan 24, 2016
e789b1d
[SPARK-12120][PYSPARK] Improve exception message when failing to init…
zjffdu Jan 24, 2016
3327fd2
[SPARK-12624][PYSPARK] Checks row length when converting Java arrays …
liancheng Jan 25, 2016
3adebfc
[SPARK-12901][SQL] Refactor options for JSON and CSV datasource (not …
HyukjinKwon Jan 25, 2016
d8e4805
[SPARK-12932][JAVA API] improved error message for java type inferenc…
andygrove Jan 25, 2016
4ee8191
[SPARK-12755][CORE] Stop the event logger before the DAG scheduler
Jan 25, 2016
dd2325d
[SPARK-11965][ML][DOC] Update user guide for RFormula feature interac…
yanboliang Jan 25, 2016
ef8fb36
Closes #10879
mengxr Jan 25, 2016
c037d25
[SPARK-12149][WEB UI] Executor UI improvement suggestions - Color UI
ajbozarth Jan 25, 2016
7d877c3
[SPARK-12902] [SQL] visualization for generated operators
Jan 25, 2016
00026fa
[SPARK-12901][SQL][HOT-FIX] Fix scala 2.11 compilation.
yhuai Jan 25, 2016
9348431
[SPARK-12975][SQL] Throwing Exception when Bucketing Columns are part…
gatorsmile Jan 25, 2016
dcae355
[SPARK-12905][ML][PYSPARK] PCAModel return eigenvalues for PySpark
yanboliang Jan 25, 2016
6f0f1d9
[SPARK-12934][SQL] Count-min sketch serialization
liancheng Jan 25, 2016
be375fc
[SPARK-12879] [SQL] improve the unsafe row writing framework
cloud-fan Jan 26, 2016
109061f
[SPARK-12936][SQL] Initial bloom filter implementation
cloud-fan Jan 26, 2016
fdcc351
[SPARK-12934] use try-with-resources for streams
tedyu Jan 26, 2016
b66afde
[SPARK-11922][PYSPARK][ML] Python api for ml.feature.quantile discret…
holdenk Jan 26, 2016
ae47ba7
[SPARK-12834] Change ser/de of JavaArray and JavaList
yinxusen Jan 26, 2016
27c910f
[SPARK-10086][MLLIB][STREAMING][PYSPARK] ignore StreamingKMeans test …
mengxr Jan 26, 2016
d54cfed
[SQL][MINOR] A few minor tweaks to CSV reader.
rxin Jan 26, 2016
6743de3
[SPARK-12937][SQL] bloom filter serialization
cloud-fan Jan 26, 2016
5936bf9
[SPARK-12961][CORE] Prevent snappy-java memory leak
viirya Jan 26, 2016
649e9d0
[SPARK-3369][CORE][STREAMING] Java mapPartitions Iterator->Iterable i…
srowen Jan 26, 2016
ae0309a
[SPARK-10911] Executors should System.exit on clean shutdown.
Jan 26, 2016
08c781c
[SPARK-12682][SQL] Add support for (optionally) not storing tables in…
sameeragarwal Jan 26, 2016
cbd507d
[SPARK-7799][STREAMING][DOCUMENT] Add the linking and deploying instr…
zsxwing Jan 26, 2016
8beab68
[SPARK-11923][ML] Python API for ml.feature.ChiSqSelector
yinxusen Jan 26, 2016
fbf7623
[SPARK-12952] EMLDAOptimizer initialize() should return EMLDAOptimize…
yinxusen Jan 26, 2016
ee74498
[SPARK-8725][PROJECT-INFRA] Test modules in topologically-sorted orde…
JoshRosen Jan 26, 2016
83507fe
[SQL] Minor Scaladoc format fix
liancheng Jan 26, 2016
19fdb21
[SPARK-12993][PYSPARK] Remove usage of ADD_FILES in pyspark
zjffdu Jan 26, 2016
eb91729
[SPARK-10509][PYSPARK] Reduce excessive param boiler plate code
holdenk Jan 26, 2016
22662b2
[SPARK-12614][CORE] Don't throw non fatal exception from ask
zsxwing Jan 27, 2016
1dac964
[SPARK-11622][MLLIB] Make LibSVMRelation extends HadoopFsRelation and…
zjffdu Jan 27, 2016
5551273
[SPARK-12854][SQL] Implement complex types support in ColumnarBatch
nongli Jan 27, 2016
b72611f
[SPARK-7780][MLLIB] intercept in logisticregressionwith lbfgs should …
holdenk Jan 27, 2016
e7f9199
[SPARK-12903][SPARKR] Add covar_samp and covar_pop for SparkR
yanboliang Jan 27, 2016
ce38a35
[SPARK-12935][SQL] DataFrame API for Count-Min Sketch
liancheng Jan 27, 2016
58f5d8c
[SPARK-12728][SQL] Integrates SQL generation with native view
liancheng Jan 27, 2016
bae3c9a
[SPARK-12967][NETTY] Avoid NettyRpc error message during sparkContext…
nishkamravi2 Jan 27, 2016
4db255c
[SPARK-12780] Inconsistency returning value of ML python models' prop…
yinxusen Jan 27, 2016
90b0e56
[SPARK-12983][CORE][DOC] Correct metrics.properties.template
BenFradet Jan 27, 2016
093291c
[SPARK-1680][DOCS] Explain environment variables for running on YARN …
weineran Jan 27, 2016
41f0c85
[SPARK-13023][PROJECT INFRA] Fix handling of root module in modules_t…
JoshRosen Jan 27, 2016
edd4737
[SPARK-10847][SQL][PYSPARK] Pyspark - DataFrame - Optional Metadata w…
jasoncl Jan 27, 2016
87abcf7
[SPARK-12895][SPARK-12896] Migrate TaskMetrics to accumulators
Jan 27, 2016
32f7411
[SPARK-13021][CORE] Fail fast when custom RDDs violate RDD.partition'…
JoshRosen Jan 27, 2016
680afab
[SPARK-12938][SQL] DataFrame API for Bloom filter
cloud-fan Jan 27, 2016
ef96cd3
[SPARK-12865][SPARK-12866][SQL] Migrate SparkSQLParser/ExtendedHiveQl…
hvanhovell Jan 27, 2016
d702f0c
[HOTFIX] Fix Scala 2.11 compilation
Jan 27, 2016
4a09123
[SPARK-13045] [SQL] Remove ColumnVector.Struct in favor of ColumnarBa…
nongli Jan 27, 2016
c220443
Provide same info as in spark-submit --help
jimlohse Jan 28, 2016
415d0a8
[SPARK-12818][SQL] Specialized integral and string types for Count-mi…
liancheng Jan 28, 2016
6768039
[SPARK-12926][SQL] SQLContext to display warning message when non-sql…
tejasapatil Jan 28, 2016
cc18a71
[SPARK-13031] [SQL] cleanup codegen and improve test coverage
Jan 28, 2016
df78a93
[SPARK-9835][ML] Implement IterativelyReweightedLeastSquares solver
yanboliang Jan 28, 2016
abae889
[SPARK-12401][SQL] Add integration tests for postgres enum types
maropu Jan 28, 2016
3a40c0e
[SPARK-12749][SQL] add json option to parse floating-point types as D…
blbradley Jan 28, 2016
4637fc0
[SPARK-11955][SQL] Mark optional fields in merging schema for safely …
viirya Jan 29, 2016
b9dfdcc
Revert "[SPARK-13031] [SQL] cleanup codegen and improve test coverage"
davies Jan 29, 2016
66449b8
[SPARK-12968][SQL] Implement command to set current database
viirya Jan 29, 2016
721ced2
[SPARK-13067] [SQL] workaround for a weird scala reflection problem
cloud-fan Jan 29, 2016
8d3cc3d
[SPARK-13050][BUILD] Scalatest tags fail build with the addition of t…
ajbozarth Jan 29, 2016
55561e7
[SPARK-13031][SQL] cleanup codegen and improve test coverage
Jan 29, 2016
e51b6ea
[SPARK-13032][ML][PYSPARK] PySpark support model export/import and ta…
yanboliang Jan 29, 2016
e4c1162
[SPARK-10873] Support column sort and search for History Server.
Jan 29, 2016
c5f745e
[SPARK-13072] [SQL] simplify and improve murmur3 hash expression codegen
cloud-fan Jan 29, 2016
5f686cc
[SPARK-12656] [SQL] Implement Intersect with Left-semi Join
gatorsmile Jan 29, 2016
2b027e9
[SPARK-12818] Polishes spark-sketch module
liancheng Jan 29, 2016
e38b0ba
[SPARK-13055] SQLHistoryListener throws ClassCastException
Jan 29, 2016
2cbc412
[SPARK-13076][SQL] Rename ClientInterface -> HiveClient
rxin Jan 30, 2016
e6ceac4
[SPARK-13096][TEST] Fix flaky verifyPeakExecutionMemorySet
Jan 30, 2016
70e69fc
[SPARK-13088] Fix DAG viz in latest version of chrome
Jan 30, 2016
12252d1
[SPARK-13071] Coalescing HadoopRDD overwrites existing input metrics
Jan 30, 2016
e6a02c6
[SPARK-12914] [SQL] generate aggregation with grouping keys
Jan 30, 2016
dab246f
[SPARK-13098] [SQL] remove GenericInternalRowWithSchema
cloud-fan Jan 30, 2016
289373b
[SPARK-6363][BUILD] Make Scala 2.11 the default Scala version
JoshRosen Jan 30, 2016
de28371
[SPARK-13100][SQL] improving the performance of stringToDate method i…
Jan 30, 2016
a1303de
[SPARK-13070][SQL] Better error message when Parquet schema merging f…
liancheng Jan 31, 2016
0e6d92d
[SPARK-12689][SQL] Migrate DDL parsing to the newly absorbed parser
viirya Jan 31, 2016
5a8b978
[SPARK-13049] Add First/last with ignore nulls to functions.scala
hvanhovell Jan 31, 2016
c1da4d4
[SPARK-13093] [SQL] improve null check in nullSafeCodeGen for unary, …
cloud-fan Feb 1, 2016
6075573
[SPARK-6847][CORE][STREAMING] Fix stack overflow issue when updateSta…
zsxwing Feb 1, 2016
33c8a49
[SPARK-12989][SQL] Delaying Alias Cleanup after ExtractWindowExpressions
gatorsmile Feb 1, 2016
8f26eb5
[SPARK-12705][SPARK-10777][SQL] Analyzer Rule ResolveSortReferences
gatorsmile Feb 1, 2016
da9146c
[DOCS] Fix the jar location of datanucleus in sql-programming-guid.md
maropu Feb 1, 2016
711ce04
[ML][MINOR] Invalid MulticlassClassification reference in ml-guide
Lewuathe Feb 1, 2016
51b03b7
[SPARK-12463][SPARK-12464][SPARK-12465][SPARK-10647][MESOS] Fix zooke…
tnachen Feb 1, 2016
a41b68b
[SPARK-12265][MESOS] Spark calls System.exit inside driver instead of…
nraychaudhuri Feb 1, 2016
c9b89a0
[SPARK-12979][MESOS] Don’t resolve paths on the local file system in …
dragos Feb 1, 2016
064b029
[SPARK-13043][SQL] Implement remaining catalyst types in ColumnarBatch.
nongli Feb 1, 2016
a2973fe
Fix for [SPARK-12854][SQL] Implement complex types support in Columna…
jaceklaskowski Feb 1, 2016
be7a2fc
[SPARK-13078][SQL] API and test cases for internal catalog
rxin Feb 1, 2016
715a19d
[SPARK-12637][CORE] Print stage info of finished stages properly
srowen Feb 2, 2016
0df3cfb
[SPARK-12790][CORE] Remove HistoryServer old multiple files format
felixcheung Feb 2, 2016
0fff5c6
[SPARK-13130][SQL] Make codegen variable names easier to read
rxin Feb 2, 2016
b8666fd
Closes #10662. Closes #10661
rxin Feb 2, 2016
22ba213
[SPARK-13087][SQL] Fix group by function for sort based aggregation
marmbrus Feb 2, 2016
12a20c1
[SPARK-10820][SQL] Support for the continuous execution of structured…
marmbrus Feb 2, 2016
29d9218
[SPARK-13094][SQL] Add encoders for seq/array of primitives
marmbrus Feb 2, 2016
b938301
[SPARK-13114][SQL] Add a test for tokens more than the fields in schema
HyukjinKwon Feb 2, 2016
cba1d6b
[SPARK-12631][PYSPARK][DOC] PySpark clustering parameter desc to cons…
BryanCutler Feb 2, 2016
358300c
[SPARK-13056][SQL] map column would throw NPE if value is null
adrian-wang Feb 2, 2016
b1835d7
[SPARK-12711][ML] ML StopWordsRemover does not protect itself from co…
grzegorz-chilkiewicz Feb 2, 2016
7f6e3ec
[SPARK-13138][SQL] Add "logical" package prefix for ddl.scala
rxin Feb 2, 2016
be5dd88
[SPARK-12913] [SQL] Improve performance of stat functions
Feb 2, 2016
d0df2ca
[SPARK-13121][STREAMING] java mapWithState mishandles scala Option
sparkyengine Feb 2, 2016
b377b03
[DOCS] Update StructType.scala
swkimme Feb 2, 2016
6de6a97
[SPARK-13150] [SQL] disable two flaky tests
Feb 3, 2016
672032d
[SPARK-13020][SQL][TEST] fix random generator for map type
cloud-fan Feb 3, 2016
21112e8
[SPARK-12992] [SQL] Update parquet reader to support more types when …
nongli Feb 3, 2016
ff71261
[SPARK-13122] Fix race condition in MemoryStore.unrollSafely()
Feb 3, 2016
99a6e3c
[SPARK-12951] [SQL] support spilling in generated aggregate
Feb 3, 2016
0557146
[SPARK-12732][ML] bug fix in linear regression train
iyounus Feb 3, 2016
335f10e
[SPARK-7997][CORE] Add rpcEnv.awaitTermination() back to SparkEnv
zsxwing Feb 3, 2016
e86f8f6
[SPARK-13147] [SQL] improve readability of generated code
Feb 3, 2016
138c300
[SPARK-12957][SQL] Initial support for constraint propagation in Spar…
sameeragarwal Feb 3, 2016
e9eb248
[SPARK-12739][STREAMING] Details of batch in Streaming tab uses two D…
mariobriggs Feb 3, 2016
c4feec2
[SPARK-12798] [SQL] generated BroadcastHashJoin
Feb 3, 2016
9dd2741
[SPARK-13157] [SQL] Support any kind of input for SQL commands.
hvanhovell Feb 3, 2016
3221edd
[SPARK-3611][WEB UI] Show number of cores for each executor in applic…
ajbozarth Feb 3, 2016
915a753
[SPARK-13166][SQL] Remove DataStreamReader/Writer
rxin Feb 4, 2016
de09145
[SPARK-13131] [SQL] Use best and average time in benchmark
Feb 4, 2016
a8e2ba7
[SPARK-13152][CORE] Fix task metrics deprecation warning
holdenk Feb 4, 2016
a648311
[SPARK-13079][SQL] Extend and implement InMemoryCatalog
Feb 4, 2016
0f81318
[SPARK-12828][SQL] add natural join support
adrian-wang Feb 4, 2016
c2c956b
[ML][DOC] fix wrong api link in ml onevsrest
hhbyyh Feb 4, 2016
d390871
[SPARK-13113] [CORE] Remove unnecessary bit operation when decoding p…
viirya Feb 4, 2016
dee801a
[SPARK-12828][SQL] Natural join follow-up
rxin Feb 4, 2016
2eaeafe
[SPARK-12330][MESOS] Fix mesos coarse mode cleanup
drcrallen Feb 4, 2016
62a7c28
[SPARK-13164][CORE] Replace deprecated synchronized buffer in core
holdenk Feb 4, 2016
4120bcb
[SPARK-13162] Standalone mode does not respect initial executors
Feb 4, 2016
15205da
[SPARK-13053][TEST] Unignore tests in InternalAccumulatorSuite
Feb 4, 2016
085f510
MAINTENANCE: Automated closing of pull requests.
Feb 4, 2016
33212cb
[SPARK-13168][SQL] Collapse adjacent repartition operators
JoshRosen Feb 4, 2016
c756bda
[SPARK-12330][MESOS][HOTFIX] Rename timeout config
Feb 4, 2016
bd38dd6
[SPARK-13079][SQL] InMemoryCatalog follow-ups
Feb 4, 2016
8e2f296
[SPARK-13195][STREAMING] Fix NoSuchElementException when a state is n…
zsxwing Feb 4, 2016
7a4b37f
[HOTFIX] Fix style violation caused by c756bda
Feb 4, 2016
6dbfc40
[SPARK-13052] waitingApps metric doesn't show the number of apps curr…
RaafatAkkad Feb 5, 2016
e3c75c6
[SPARK-12850][SQL] Support Bucket Pruning (Predicate Pushdown for Buc…
gatorsmile Feb 5, 2016
352102e
[SPARK-13208][CORE] Replace use of Pairs with Tuple2s
jodersky Feb 5, 2016
82d84ff
[SPARK-13187][SQL] Add boolean/long/double options in DataFrameReader…
rxin Feb 5, 2016
7b73f17
[SPARK-13166][SQL] Rename DataStreamReaderWriterSuite to DataFrameRea…
zsxwing Feb 5, 2016
1ed354a
[SPARK-12939][SQL] migrate encoder resolution logic to Analyzer
cloud-fan Feb 5, 2016
66e1383
[SPARK-13214][DOCS] update dynamicAllocation documentation
Feb 5, 2016
0bb5b73
[SPARK-13002][MESOS] Send initial request of executors for dyn alloca…
Feb 5, 2016
875f507
[SPARK-13215] [SQL] remove fallback in codegen
Feb 5, 2016
6883a51
[SPARK-13171][CORE] Replace future calls with Future
jodersky Feb 6, 2016
4f28291
[HOTFIX] fix float part of avgRate
davies Feb 6, 2016
81da3be
[SPARK-5865][API DOC] Add doc warnings for methods that return local …
Wenpei Feb 6, 2016
bc8890b
[SPARK-13132][MLLIB] cache standardization param value in LogisticReg…
idigary Feb 7, 2016
140ddef
[SPARK-10963][STREAMING][KAFKA] make KafkaCluster public
koeninger Feb 7, 2016
edf4a0e
[SPARK-12986][DOC] Fix pydoc warnings in mllib/regression.py
nampham2 Feb 8, 2016
06f0df6
[SPARK-8964] [SQL] Use Exchange to perform shuffle in Limit
JoshRosen Feb 8, 2016
8e4d15f
[SPARK-13101][SQL] nullability of array type element should not fail …
cloud-fan Feb 8, 2016
37bc203
[SPARK-13210][SQL] catch OOM when allocate memory and expand array
Feb 8, 2016
ff0af0d
[SPARK-13095] [SQL] improve performance for broadcast join with dimen…
Feb 8, 2016
eeaf45b
[SPARK-10620][SPARK-13054] Minor addendum to #10835
Feb 9, 2016
3708d13
[SPARK-12992] [SQL] Support vectorized decoding in UnsafeRowParquetRe…
nongli Feb 9, 2016
f9307d8
[SPARK-13176][CORE] Use native file linking instead of external proce…
jodersky Feb 9, 2016
159198e
[SPARK-13165][STREAMING] Replace deprecated synchronizedBuffer in str…
holdenk Feb 9, 2016
ce83fe9
[SPARK-13201][SPARK-13200] Deprecation warning cleanups: KMeans & MFD…
holdenk Feb 9, 2016
c882ec5
[SPARK-13040][DOCS] Update JDBC deprecated SPARK_CLASSPATH documentation
tiangolo Feb 9, 2016
d9ba4d2
[SPARK-13177][EXAMPLES] Update ActorWordCount example to not directly…
agsachin Feb 9, 2016
e30121a
[SPARK-13086][SHELL] Use the Scala REPL settings, to enable things li…
dragos Feb 9, 2016
68ed363
[SPARK-13170][STREAMING] Investigate replacing SynchronizedQueue as i…
srowen Feb 9, 2016
34d0b70
[SPARK-12807][YARN] Spark External Shuffle not working in Hadoop clus…
steveloughran Feb 9, 2016
2dbb916
[SPARK-13189] Cleanup build references to Scala 2.10
lresende Feb 9, 2016
7fe4fe6
[SPARK-12888] [SQL] [FOLLOW-UP] benchmark the new hash expression
cloud-fan Feb 9, 2016
fae830d
[SPARK-13245][CORE] Call shuffleMetrics methods only in one thread fo…
zsxwing Feb 10, 2016
0e5ebac
[SPARK-12950] [SQL] Improve lookup of BytesToBytesMap in aggregate
Feb 10, 2016
9267bc6
[SPARK-10524][ML] Use the soft prediction to order categories' bins
viirya Feb 10, 2016
6f710f9
[SPARK-12476][SQL] Implement JdbcRelation#unhandledFilters for removi…
maropu Feb 10, 2016
b385ce3
[SPARK-13149][SQL] Add FileStreamSource
zsxwing Feb 10, 2016
9269036
[SPARK-11565] Replace deprecated DigestUtils.shaHex call
gliptak Feb 10, 2016
2ba9b6a
[SPARK-11518][DEPLOY, WINDOWS] Handle spaces in Windows command scripts
tritab Feb 10, 2016
e834e42
[SPARK-13203] Add scalastyle rule banning use of mutable.Synchronized…
tedyu Feb 10, 2016
c0b71e0
[SPARK-9307][CORE][SPARK] Logging: Make it either stable or private
srowen Feb 10, 2016
80cb963
[SPARK-5095][MESOS] Support launching multiple mesos executors in coa…
Feb 10, 2016
5cf2059
[SPARK-13254][SQL] Fix planning of TakeOrderedAndProject operator
JoshRosen Feb 10, 2016
39cc620
[SPARK-13163][WEB UI] Column width on new History Server DataTables n…
ajbozarth Feb 10, 2016
4b80026
[SPARK-13126] fix the right margin of history page.
Feb 10, 2016
ce3bdae
[HOTFIX] Fix Scala 2.10 build break in TakeOrderedAndProjectSuite.
JoshRosen Feb 10, 2016
5947fa8
[SPARK-13057][SQL] Add benchmark codes and the performance results fo…
maropu Feb 10, 2016
29c5473
[SPARK-12414][CORE] Remove closure serializer
srowen Feb 10, 2016
0902e20
[SPARK-13146][SQL] Management API for continuous queries
tdas Feb 11, 2016
719973b
[SPARK-13274] Fix Aggregator Links on GroupedDataset Scala API
raelawang Feb 11, 2016
663cc40
[SPARK-12725][SQL] Resolving Name Conflicts in SQL Generation and Nam…
gatorsmile Feb 11, 2016
0f09f02
[SPARK-13205][SQL] SQL Generation Support for Self Join
gatorsmile Feb 11, 2016
b5761d1
[SPARK-12706] [SQL] grouping() and grouping_id()
Feb 11, 2016
8f744fe
[SPARK-13234] [SQL] remove duplicated SQL metrics
Feb 11, 2016
1842c55
[SPARK-13276] Catch bad characters at the end of a Table Identifier/E…
hvanhovell Feb 11, 2016
e88bff1
[SPARK-13235][SQL] Removed an Extra Distinct from the Plan when Using…
gatorsmile Feb 11, 2016
18bcbbd
[SPARK-13270][SQL] Remove extra new lines in whole stage codegen and …
nongli Feb 11, 2016
c2f21d8
[SPARK-13264][DOC] Removed multi-byte characters in spark-env.sh.temp…
sasakitoa Feb 11, 2016
f9ae99f
[SPARK-13074][CORE] Add JavaSparkContext. getPersistentRDDs method
jyssky Feb 11, 2016
13c17cb
[SPARK-13124][WEB UI] Fixed CSS and JS issues caused by addition of J…
ajbozarth Feb 11, 2016
219a74a
[STREAMING][TEST] Fix flaky streaming.FailureSuite
tdas Feb 11, 2016
e31c807
[SPARK-13277][SQL] ANTLR ignores other rule using the USING keyword
viirya Feb 11, 2016
0d50a22
[SPARK-12982][SQL] Add table name validation in temp table registration
jayadevanmurali Feb 11, 2016
50fa6fd
[SPARK-13279] Remove O(n^2) operation from scheduler.
Feb 11, 2016
c86009c
Revert "[SPARK-13279] Remove O(n^2) operation from scheduler."
rxin Feb 11, 2016
efb65e0
[SPARK-13265][ML] Refactoring of basic ML import/export for other fil…
yu-iskw Feb 11, 2016
574571c
[SPARK-11515][ML] QuantileDiscretizer should take random seed
yu-iskw Feb 11, 2016
c8f667d
[SPARK-13037][ML][PYSPARK] PySpark ml.recommendation support export/i…
vectorijk Feb 11, 2016
2426eb3
[MINOR][ML][PYSPARK] Cleanup test cases of clustering.py
yanboliang Feb 11, 2016
30e0095
[SPARK-13035][ML][PYSPARK] PySpark ml.clustering support export/import
yanboliang Feb 11, 2016
b354673
[SPARK-13047][PYSPARK][ML] Pyspark Params.hasParam should not throw a…
sethah Feb 12, 2016
a525704
[SPARK-12765][ML][COUNTVECTORIZER] fix CountVectorizer.transform's lo…
sloth2012 Feb 12, 2016
b10af5e
[SPARK-12915][SQL] add SQL metrics of numOutputRows for whole stage c…
Feb 12, 2016
8121a4b
[SPARK-13277][BUILD] Follow-up ANTLR warnings are treated as build er…
hvanhovell Feb 12, 2016
5f1c359
[SPARK-12746][ML] ArrayType(_, true) should also accept ArrayType(_, …
Earthson Feb 12, 2016
d3e2e20
[SPARK-13153][PYSPARK] ML persistence failed when handle no default v…
Wenpei Feb 12, 2016
a2c7dcf
[SPARK-7889][WEBUI] HistoryServer updates UI for incomplete apps
steveloughran Feb 12, 2016
894921d
[SPARK-6166] Limit number of in flight outbound requests
Feb 12, 2016
a183dda
[SPARK-12974][ML][PYSPARK] Add Python API for spark.ml bisecting k-means
yanboliang Feb 12, 2016
64515e5
[SPARK-13154][PYTHON] Add linting for pydocs
holdenk Feb 12, 2016
5b805df
[SPARK-12705] [SQL] push missing attributes for Sort
Feb 12, 2016
c4d5ad8
[SPARK-13282][SQL] LogicalPlan toSql should just return a String
rxin Feb 12, 2016
ac7d6af
[SPARK-13260][SQL] count(*) does not work with CSV data source
HyukjinKwon Feb 12, 2016
90de6b2
[SPARK-12962] [SQL] [PySpark] PySpark support covar_samp and covar_pop
yanboliang Feb 12, 2016
42d6568
[SPARK-12630][PYSPARK] [DOC] PySpark classification parameter desc to…
vijaykiran Feb 12, 2016
38bc601
[SPARK-5095] Fix style in mesos coarse grained scheduler code
Feb 12, 2016
62b1c07
[SPARK-5095] remove flaky test
Feb 12, 2016
2228f07
[SPARK-13293][SQL] generate Expand
Feb 13, 2016
374c4b2
[SPARK-13142][WEB UI] Problem accessing Web UI /logPage/ on Microsoft…
Feb 13, 2016
e3441e3
[SPARK-12363][MLLIB] Remove setRun and fix PowerIterationClustering f…
viirya Feb 13, 2016
610196f
Closes #11185
rxin Feb 14, 2016
388cd9e
[SPARK-13172][CORE][SQL] Stop using RichException.getStackTrace it is…
srowen Feb 14, 2016
354d4c2
[SPARK-13296][SQL] Move UserDefinedFunction into sql.expressions.
rxin Feb 14, 2016
331293c
[SPARK-13300][DOCUMENTATION] Added pygments.rb dependancy
amitdev Feb 14, 2016
22e9723
[SPARK-13278][CORE] Launcher fails to start with JDK 9 EA
cl4es Feb 14, 2016
7cb4d74
[SPARK-13185][SQL] Reuse Calendar object in DateTimeUtils.StringToDat…
carsonwang Feb 15, 2016
a8bbc4f
[SPARK-12503][SPARK-12505] Limit pushdown in UNION ALL and OUTER JOIN
JoshRosen Feb 15, 2016
56d4939
[SPARK-12995][GRAPHX] Remove deprecate APIs from Pregel
maropu Feb 15, 2016
adb5483
[SPARK-13312][MLLIB] Update java train-validation-split example in ml…
JeremyNixon Feb 15, 2016
cbeb006
[SPARK-13097][ML] Binarizer allowing Double AND Vector input types
seddonm1 Feb 16, 2016
e4675c2
[SPARK-13018][DOCS] Replace example code in mllib-pmml-model-export.m…
keypointt Feb 16, 2016
fee739f
[SPARK-13221] [SQL] Fixing GroupingSets when Aggregate Functions Cont…
gatorsmile Feb 16, 2016
827ed1c
Correct SparseVector.parse documentation
mgyucht Feb 16, 2016
00c72d2
[SPARK-12247][ML][DOC] Documentation for spark.ml's ALS and collabora…
BenFradet Feb 16, 2016
19dc69d
[SPARK-12976][SQL] Add LazilyGenerateOrdering and use it for RangePar…
ueshin Feb 16, 2016
c7d00a2
[SPARK-13280][STREAMING] Use a better logger name for FileBasedWriteA…
Feb 16, 2016
5f37aad
[SPARK-13308] ManagedBuffers passed to OneToOneStreamManager need to …
JoshRosen Feb 16, 2016
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
1 change: 0 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -60,7 +60,6 @@ dev/create-release/*final
spark-*-bin-*.tgz
unit-tests.log
/lib/
ec2/lib/
rat-results.txt
scalastyle.txt
scalastyle-output.xml
Expand Down
27 changes: 21 additions & 6 deletions .rat-excludes
Original file line number Diff line number Diff line change
Expand Up @@ -25,6 +25,16 @@ graphlib-dot.min.js
sorttable.js
vis.min.js
vis.min.css
dataTables.bootstrap.css
dataTables.bootstrap.min.js
dataTables.rowsGroup.js
jquery.blockUI.min.js
jquery.cookies.2.2.0.min.js
jquery.dataTables.1.10.4.min.css
jquery.dataTables.1.10.4.min.js
jquery.mustache.js
jsonFormatter.min.css
jsonFormatter.min.js
.*avsc
.*txt
.*json
Expand Down Expand Up @@ -63,12 +73,12 @@ logs
.*dependency-reduced-pom.xml
known_translations
json_expectation
local-1422981759269/*
local-1422981780767/*
local-1425081759269/*
local-1426533911241/*
local-1426633911242/*
local-1430917381534/*
local-1422981759269
local-1422981780767
local-1425081759269
local-1426533911241
local-1426633911242
local-1430917381534
local-1430917381535_1
local-1430917381535_2
DESCRIPTION
Expand All @@ -82,4 +92,9 @@ INDEX
gen-java.*
.*avpr
org.apache.spark.sql.sources.DataSourceRegister
org.apache.spark.scheduler.SparkHistoryListenerFactory
.*parquet
LZ4BlockInputStream.java
spark-deps-.*
.*csv
.*tsv
29 changes: 17 additions & 12 deletions LICENSE
Original file line number Diff line number Diff line change
@@ -1,4 +1,3 @@

Apache License
Version 2.0, January 2004
http://www.apache.org/licenses/
Expand Down Expand Up @@ -237,7 +236,7 @@ The following components are provided under a BSD-style license. See project lin
The text of each license is also included at licenses/LICENSE-[project].txt.

(BSD 3 Clause) netlib core (com.github.fommil.netlib:core:1.1.2 - https://github.com/fommil/netlib-java/core)
(BSD 3 Clause) JPMML-Model (org.jpmml:pmml-model:1.1.15 - https://github.com/jpmml/jpmml-model)
(BSD 3 Clause) JPMML-Model (org.jpmml:pmml-model:1.2.7 - https://github.com/jpmml/jpmml-model)
(BSD 3-clause style license) jblas (org.jblas:jblas:1.2.4 - http://jblas.org/)
(BSD License) AntLR Parser Generator (antlr:antlr:2.7.7 - http://www.antlr.org/)
(BSD licence) ANTLR ST4 4.0.4 (org.antlr:ST4:4.0.4 - http://www.stringtemplate.org)
Expand All @@ -250,22 +249,22 @@ The text of each license is also included at licenses/LICENSE-[project].txt.
(Interpreter classes (all .scala files in repl/src/main/scala
except for Main.Scala, SparkHelper.scala and ExecutorClassLoader.scala),
and for SerializableMapWrapper in JavaUtils.scala)
(BSD-like) Scala Actors library (org.scala-lang:scala-actors:2.10.5 - http://www.scala-lang.org/)
(BSD-like) Scala Compiler (org.scala-lang:scala-compiler:2.10.5 - http://www.scala-lang.org/)
(BSD-like) Scala Compiler (org.scala-lang:scala-reflect:2.10.5 - http://www.scala-lang.org/)
(BSD-like) Scala Library (org.scala-lang:scala-library:2.10.5 - http://www.scala-lang.org/)
(BSD-like) Scalap (org.scala-lang:scalap:2.10.5 - http://www.scala-lang.org/)
(BSD-style) scalacheck (org.scalacheck:scalacheck_2.10:1.10.0 - http://www.scalacheck.org)
(BSD-style) spire (org.spire-math:spire_2.10:0.7.1 - http://spire-math.org)
(BSD-style) spire-macros (org.spire-math:spire-macros_2.10:0.7.1 - http://spire-math.org)
(BSD-like) Scala Actors library (org.scala-lang:scala-actors:2.11.7 - http://www.scala-lang.org/)
(BSD-like) Scala Compiler (org.scala-lang:scala-compiler:2.11.7 - http://www.scala-lang.org/)
(BSD-like) Scala Compiler (org.scala-lang:scala-reflect:2.11.7 - http://www.scala-lang.org/)
(BSD-like) Scala Library (org.scala-lang:scala-library:2.11.7 - http://www.scala-lang.org/)
(BSD-like) Scalap (org.scala-lang:scalap:2.11.7 - http://www.scala-lang.org/)
(BSD-style) scalacheck (org.scalacheck:scalacheck_2.11:1.10.0 - http://www.scalacheck.org)
(BSD-style) spire (org.spire-math:spire_2.11:0.7.1 - http://spire-math.org)
(BSD-style) spire-macros (org.spire-math:spire-macros_2.11:0.7.1 - http://spire-math.org)
(New BSD License) Kryo (com.esotericsoftware.kryo:kryo:2.21 - http://code.google.com/p/kryo/)
(New BSD License) MinLog (com.esotericsoftware.minlog:minlog:1.2 - http://code.google.com/p/minlog/)
(New BSD License) ReflectASM (com.esotericsoftware.reflectasm:reflectasm:1.07 - http://code.google.com/p/reflectasm/)
(New BSD license) Protocol Buffer Java API (com.google.protobuf:protobuf-java:2.5.0 - http://code.google.com/p/protobuf)
(New BSD license) Protocol Buffer Java API (org.spark-project.protobuf:protobuf-java:2.4.1-shaded - http://code.google.com/p/protobuf)
(The BSD License) Fortran to Java ARPACK (net.sourceforge.f2j:arpack_combined_all:0.1 - http://f2j.sourceforge.net)
(The BSD License) xmlenc Library (xmlenc:xmlenc:0.52 - http://xmlenc.sourceforge.net)
(The New BSD License) Py4J (net.sf.py4j:py4j:0.9 - http://py4j.sourceforge.net/)
(The New BSD License) Py4J (net.sf.py4j:py4j:0.9.1 - http://py4j.sourceforge.net/)
(Two-clause BSD-style license) JUnit-Interface (com.novocode:junit-interface:0.10 - http://github.com/szeiger/junit-interface/)
(BSD licence) sbt and sbt-launch-lib.bash
(BSD 3 Clause) d3.min.js (https://github.com/mbostock/d3/blob/master/LICENSE)
Expand All @@ -284,11 +283,17 @@ The text of each license is also included at licenses/LICENSE-[project].txt.
(MIT License) SLF4J API Module (org.slf4j:slf4j-api:1.7.5 - http://www.slf4j.org)
(MIT License) SLF4J LOG4J-12 Binding (org.slf4j:slf4j-log4j12:1.7.5 - http://www.slf4j.org)
(MIT License) pyrolite (org.spark-project:pyrolite:2.0.1 - http://pythonhosted.org/Pyro4/)
(MIT License) scopt (com.github.scopt:scopt_2.10:3.2.0 - https://github.com/scopt/scopt)
(MIT License) scopt (com.github.scopt:scopt_2.11:3.2.0 - https://github.com/scopt/scopt)
(The MIT License) Mockito (org.mockito:mockito-core:1.9.5 - http://www.mockito.org)
(MIT License) jquery (https://jquery.org/license/)
(MIT License) AnchorJS (https://github.com/bryanbraun/anchorjs)
(MIT License) graphlib-dot (https://github.com/cpettitt/graphlib-dot)
(MIT License) dagre-d3 (https://github.com/cpettitt/dagre-d3)
(MIT License) sorttable (https://github.com/stuartlangridge/sorttable)
(MIT License) boto (https://github.com/boto/boto/blob/develop/LICENSE)
(MIT License) datatables (http://datatables.net/license)
(MIT License) mustache (https://github.com/mustache/mustache/blob/master/LICENSE)
(MIT License) cookies (http://code.google.com/p/cookies/wiki/License)
(MIT License) blockUI (http://jquery.malsup.com/block/)
(MIT License) RowsGroup (http://datatables.net/license/mit)
(MIT License) jsonFormatter (http://www.jqueryscript.net/other/jQuery-Plugin-For-Pretty-JSON-Formatting-jsonFormatter.html)
61 changes: 60 additions & 1 deletion NOTICE
Original file line number Diff line number Diff line change
Expand Up @@ -606,4 +606,63 @@ Vis.js uses and redistributes the following third-party libraries:

- keycharm
https://github.com/AlexDM0/keycharm
The MIT License
The MIT License

===============================================================================

The CSS style for the navigation sidebar of the documentation was originally
submitted by Óscar Nájera for the scikit-learn project. The scikit-learn project
is distributed under the 3-Clause BSD license.
===============================================================================

For CSV functionality:

/*
* Copyright 2014 Databricks
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
* You may obtain a copy of the License at
*
* http://www.apache.org/licenses/LICENSE-2.0
*
* Unless required by applicable law or agreed to in writing, software
* distributed under the License is distributed on an "AS IS" BASIS,
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
* See the License for the specific language governing permissions and
* limitations under the License.
*/

/*
* Copyright 2015 Ayasdi Inc
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
* You may obtain a copy of the License at
*
* http://www.apache.org/licenses/LICENSE-2.0
*
* Unless required by applicable law or agreed to in writing, software
* distributed under the License is distributed on an "AS IS" BASIS,
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
* See the License for the specific language governing permissions and
* limitations under the License.
*/


===============================================================================
For dev/sparktestsupport/toposort.py:

Copyright 2014 True Blade Systems, Inc.

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
10 changes: 10 additions & 0 deletions R/README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,16 @@
# R on Spark

SparkR is an R package that provides a light-weight frontend to use Spark from R.
### Installing sparkR

Libraries of sparkR need to be created in `$SPARK_HOME/R/lib`. This can be done by running the script `$SPARK_HOME/R/install-dev.sh`.
By default the above script uses the system wide installation of R. However, this can be changed to any user installed location of R by setting the environment variable `R_HOME` the full path of the base directory where R is installed, before running install-dev.sh script.
Example:
```
# where /home/username/R is where R is installed and /home/username/R/bin contains the files R and RScript
export R_HOME=/home/username/R
./install-dev.sh
```

### SparkR development

Expand Down
11 changes: 9 additions & 2 deletions R/install-dev.sh
Original file line number Diff line number Diff line change
Expand Up @@ -35,12 +35,19 @@ LIB_DIR="$FWDIR/lib"
mkdir -p $LIB_DIR

pushd $FWDIR > /dev/null
if [ ! -z "$R_HOME" ]
then
R_SCRIPT_PATH="$R_HOME/bin"
else
R_SCRIPT_PATH="$(dirname $(which R))"
fi
echo "USING R_HOME = $R_HOME"

# Generate Rd files if devtools is installed
Rscript -e ' if("devtools" %in% rownames(installed.packages())) { library(devtools); devtools::document(pkg="./pkg", roclets=c("rd")) }'
"$R_SCRIPT_PATH/"Rscript -e ' if("devtools" %in% rownames(installed.packages())) { library(devtools); devtools::document(pkg="./pkg", roclets=c("rd")) }'

# Install SparkR to $LIB_DIR
R CMD INSTALL --library=$LIB_DIR $FWDIR/pkg/
"$R_SCRIPT_PATH/"R CMD INSTALL --library=$LIB_DIR $FWDIR/pkg/

# Zip the SparkR package so that it can be distributed to worker nodes on YARN
cd $LIB_DIR
Expand Down
2 changes: 1 addition & 1 deletion R/pkg/DESCRIPTION
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
Package: SparkR
Type: Package
Title: R frontend for Spark
Version: 1.6.0
Version: 2.0.0
Date: 2013-09-09
Author: The Apache Software Foundation
Maintainer: Shivaram Venkataraman <[email protected]>
Expand Down
19 changes: 17 additions & 2 deletions R/pkg/NAMESPACE
Original file line number Diff line number Diff line change
Expand Up @@ -35,10 +35,14 @@ exportMethods("arrange",
"count",
"cov",
"corr",
"covar_samp",
"covar_pop",
"crosstab",
"describe",
"dim",
"distinct",
"drop",
"dropDuplicates",
"dropna",
"dtypes",
"except",
Expand Down Expand Up @@ -92,7 +96,10 @@ exportMethods("arrange",
"with",
"withColumn",
"withColumnRenamed",
"write.df")
"write.df",
"write.json",
"write.parquet",
"write.text")

exportClasses("Column")

Expand Down Expand Up @@ -127,15 +134,18 @@ exportMethods("%in%",
"count",
"countDistinct",
"crc32",
"hash",
"cume_dist",
"date_add",
"date_format",
"date_sub",
"datediff",
"dayofmonth",
"dayofyear",
"decode",
"dense_rank",
"desc",
"encode",
"endsWith",
"exp",
"explode",
Expand Down Expand Up @@ -225,6 +235,7 @@ exportMethods("%in%",
"stddev",
"stddev_pop",
"stddev_samp",
"struct",
"sqrt",
"startsWith",
"substr",
Expand Down Expand Up @@ -267,8 +278,12 @@ export("as.DataFrame",
"loadDF",
"parquetFile",
"read.df",
"read.json",
"read.parquet",
"read.text",
"sql",
"table",
"str",
"tableToDF",
"tableNames",
"tables",
"uncacheTable")
Expand Down
Loading