Apache Spark Developers List

This forum is an archive for the mailing list dev@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
1234567 ... 160
Topics (5569)
Replies Last Post Views
Running lint-java during PR builds? by Marcelo Vanzin
12
by Hyukjin Kwon
[SQL] Two ScalarSubquery expressions?! Could we have ScalarSubqueryExec instead? by Jacek Laskowski
0
by Jacek Laskowski
[VOTE] Spark 2.3.1 (RC2) by Marcelo Vanzin
8
by Marcelo Vanzin
Design proposal for streaming APIs in data source V2 by Joseph Torres
0
by Joseph Torres
Integrating ML/DL frameworks with Spark by rxin
15
by Xiangrui Meng-2
ML Pipelines in R by Hossein
1
by Hossein
Repeated FileSourceScanExec.metrics from ColumnarBatchScan.metrics by Jacek Laskowski
0
by Jacek Laskowski
Sort-merge join improvement by pzecevic
4
by pzecevic
[VOTE] Spark 2.3.1 (RC1) by Marcelo Vanzin
18
by Marcelo Vanzin
eager execution and debuggability by rxin
15
by Ryan Blue
[DISCUSS] PySpark Window UDF by Li Jin
0
by Li Jin
Preventing predicate pushdown by Tomasz Gawęda
2
by Tomasz Gawęda
parser error? by rxin
3
by Marco Gaido
InMemoryTableScanExec.inputRDD and buffers (RDD[CachedBatch]) by Jacek Laskowski
0
by Jacek Laskowski
Build timeout -- continuous-integration/appveyor/pr — AppVeyor build failed by ifilonenko
3
by Hyukjin Kwon
Time for 2.3.1? by Marcelo Vanzin
7
by Shivaram Venkatarama...
Possible SPIP to improve matrix and vector column type support by Leif Walsh
3
by Leif Walsh
Spark UI Source Code by Anshi-Shrivastava
1
by Anshi-Shrivastava
Documenting the various DataFrame/SQL join types by Nicholas Chammas
2
by Nicholas Chammas
Identifying specific persisted DataFrames via getPersistentRDDs() by Nicholas Chammas
4
by Nicholas Chammas
[DISCUSS] Spark SQL internal data: InternalRow or UnsafeRow? by Ryan Blue
4
by Ryan Blue
Design for continuous processing shuffle by Joseph Torres
1
by Yuanjian Li
Re: [Structured streaming, V2] commit on ContinuousReader by Joseph Torres
0
by Joseph Torres
Re: org.apache.spark.shuffle.FetchFailedException: Too large frame: by Ryan Blue
1
by Ryan Blue
SparkR test failures in PR builder by Joseph Bradley
3
by Xiao Li
Custom datasource as a wrapper for existing ones? by jwozniak
6
by Jörn Franke
Sorting on a streaming dataframe by Hemant Bhanawat
9
by Hemant Bhanawat
Datasource API V2 and checkpointing by Thakrar, Jayesh
10
by Thakrar, Jayesh
[build system] jenkins master unreachable, build system currently down by shane knapp
4
by Joseph Bradley
PySpark.sql.filter not performing as it should by 880f0464
0
by 880f0464
re: sharing data via kafka broker using spark streaming/ AnalysisException on collect() by peterliu
0
by peterliu
[Kubernetes] structured-streaming driver restarts / roadmap by lucas-vsco
2
by ozb
[MLLib] Logistic Regression and standadization by Filipp Zhinkin
8
by acopich
Correlated subqueries in the DataFrame API by Nicholas Chammas
3
by Nicholas Chammas
unsubscribe by Deepesh Maheshwari
0
by Deepesh Maheshwari
1234567 ... 160