Apache Spark Developers List

This forum is an archive for the mailing list dev@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
1234 ... 144
Topics (5008)
Replies Last Post Views
Fwd: A question about rdd transformation by Lionel Luffy
0
by Lionel Luffy
Handling nulls in vector columns is non-trivial by franklyn
3
by franklyn
[How-To][SQL] Create a dataframe inside the TableScan.buildScan method of a relation by OBones
0
by OBones
Why does Spark SQL use custom spark.sql.execution.id local property not SparkContext.setJobGroup? by Jacek Laskowski
0
by Jacek Laskowski
[build system] when it rains... berkeley lost power. again. use new url to visit jenkins by shane knapp
3
by shane knapp
[VOTE] Apache Spark 2.2.0 (RC5) by Michael Armbrust
4
by Imran Rashid-3
[build system] patching post-mortem: back to normal! by shane knapp
0
by shane knapp
[VOTE] Apache Spark 2.2.0 (RC4) by Michael Armbrust
44
by Michael Armbrust
[build system] rolling back R to working version by shane knapp
2
by Felix Cheung
appendix by sunerhan1992@sina.co...
2
by sunerhan1992@sina.co...
appendix by sunerhan1992@sina.co...
0
by sunerhan1992@sina.co...
Total memory tracking: request for comments by Jose Soltren
0
by Jose Soltren
[build system] [fixed] system update broke symlink for pypy-2.5.1, PRB builds failing by shane knapp
0
by shane knapp
Output Committers for S3 by Matthew Schauer
18
by Steve Loughran
[build system] immediate emergency updates and reboot to deal w/stack clash vulnerability by shane knapp
6
by shane knapp
dataframe mappartitions problem by sunerhan1992@sina.co...
2
by sunerhan1992@sina.co...
the meaning of partition column and bucket column please? by 萝卜丝炒饭
0
by 萝卜丝炒饭
the scheme in stream reader by 萝卜丝炒饭
2
by 萝卜丝炒饭
Unsubscribe by praba karan
0
by praba karan
cannot call explain or show on dataframe in structured streaming addBatch dataframe by assaf.mendelson
1
by Michael Armbrust
Question: why is Externalizable used? by Sean Owen
1
by rxin
Unsubscribe by vijendra rana
0
by vijendra rana
Memory issue in pyspark for 1.6 mb file by Naga Guduru
1
by Pralabh Kumar
Crowdsourced triage Scapegoat compiler plugin warnings by Josh Rosen-2
2
by Sean Owen
Custom Partitioning in Catalyst by RussS
3
by rxin
structured streaming documentation does not match behavior by assaf.mendelson
2
by Shixiong(Ryan) Zhu
How does MapWithStateRDD distribute the data by Soumitra Johri
2
by coolcoolkid
featureSubsetStrategy parameter for GradientBoostedTreesModel by Pralabh Kumar
1
by Pralabh Kumar
the dependence length of RDD, can its size be greater than 1 pleaae? by 萝卜丝炒饭
3
by 萝卜丝炒饭
Nested "struct" fonction call creates a compilation error in Spark SQL by Olivier Girardot-2
3
by Michael Armbrust
Performance regression for partitioned parquet data by Bertrand Bossy
3
by Bertrand Bossy
Re: [apache/spark] [TEST][SPARKR][CORE] Fix broken SparkSubmitSuite (#18283) by Sean Owen
0
by Sean Owen
SPARK-19547 by Rastogi, Pankaj
3
by Sree V
Can I use ChannelTrafficShapingHandler to control the network read/write speed in shuffle? by hustnn
2
by hustnn
[build system] jenkins currently down due to campus-wide power failure by shane knapp
2
by Anthony D. Joseph
1234 ... 144