Apache Spark Developers List

This forum is an archive for the mailing list dev@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
12345 ... 163
Topics (5680)
Replies Last Post Views
Spark data quality bug when reading parquet files from hive metastore by Long, Andrew-2
3
by Long, Andrew-2
Spark JIRA tags clarification and management by Hyukjin Kwon
9
by rxin
time for Apache Spark 3.0? by rxin
30
by Matei Zaharia
python test infrastructure by Imran Rashid-4
3
by Imran Rashid-4
Pool Information Details cannot be accessed from HistoryServer UI by sandeep_katta
0
by sandeep_katta
How to parallelize JDBC Read in Spark by Chetan Khatri
0
by Chetan Khatri
no logging in pyspark code? by Imran Rashid-4
3
by Hyukjin Kwon
code freeze and branch cut for Apache Spark 2.4 by rxin
56
by Hyukjin Kwon
Kubernetes Big-Data-SIG notes, September 5 by Erik Erlandson-2
0
by Erik Erlandson-2
Select top (100) percent equivalent in spark by Chetan Khatri
6
by Liang-Chi Hsieh
[DISCUSS] SPIP: APIs for Table Metadata Operations by Ryan Blue
11
by RussS
[ML] Setting Non-Transform Params for a Pipeline & PipelineModel by Aleksander Eskilson
0
by Aleksander Eskilson
Nightly Builds in the docs (in spark-nightly/spark-master-bin/latest? Can't seem to find it) by Jacek Laskowski
5
by shane knapp
Jenkins automatic disabling service - who and why? by Hyukjin Kwon
5
by shane knapp
mllib + SQL by Hemant Bhanawat
5
by Hemant Bhanawat
[discuss] replacing SPIP template with Heilmeier's Catechism? by rxin
7
by Ryan Blue
TimSort bug by rxin
3
by rxin
SPIP: Executor Plugin (SPARK-24918) by Imran Rashid-4
14
by rxin
Upgrade SBT to the latest by Darcy Shen
2
by Ted Yu
[DISCUSS] move away from python doctests by Imran Rashid-4
6
by Hyukjin Kwon
Spark Streaming : Multiple sources found for csv : Error by Srabasti Banerjee
4
by Srabasti Banerjee
Update to Kryo 4 for Spark 2.4? by Sean Owen-3
0
by Sean Owen-3
Joining DataFrames derived from the same source yields confusing/incorrect results by Nicholas Chammas
1
by Tomasz Gawęda
Persisting driver logs in yarn client mode (SPARK-25118) by Ankur Gupta
6
by Henry Robinson
SparkContext singleton get w/o create? by Andrew Melo
9
by Andrew Melo
Why is View logical operator not a UnaryNode explicitly? by Jacek Laskowski
0
by Jacek Laskowski
Reading 20 GB of log files from Directory - Out of Memory Error by Chetan Khatri
1
by Chetan Khatri
multiple group by action by 崔苗
0
by 崔苗
Porting or explicitly linking project style in Apache Spark based on https://github.com/databricks/scala-style-guide by Hyukjin Kwon
5
by Hyukjin Kwon
[MLlib][Test] Smoke and Metamorphic Testing of MLlib by Steffen Herbold
5
by Matei Zaharia
Spark github sync works now by Xiao Li
0
by Xiao Li
Spark DataFrame UNPIVOT feature by Ivan Gozali
3
by zero323
[Performance] Spark DataFrame is slow with wide data. Polynomial complexity on the number of columns is observed. Why? by makatun
12
by makatun
[DISCUSS] USING syntax for Datasource V2 by Hyukjin Kwon
4
by RussS
[discuss][minor] impending python 3.x jenkins upgrade... 3.5.x? 3.6.x? by shane knapp
5
by shane knapp
12345 ... 163