Apache Spark Developers List

This forum is an archive for the mailing list dev@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
123456 ... 148
Topics (5162)
Replies Last Post Views
[SS] Collapsing EventTimeWatermark logical operators? by Jacek Laskowski
0
by Jacek Laskowski
[Structured Streaming] OOM on ConsoleSink with large inputs by Gerard Maas
0
by Gerard Maas
[SS] watermark, eventTime and "StreamExecution: Streaming query made progress" by Jacek Laskowski
1
by Michael Armbrust
[build system] jenkins back up and building by shane knapp
0
by shane knapp
Any comitter interested in Speaking a Solutions.Hamburg by Christofer Dutz
0
by Christofer Dutz
Use Apache ORC in Apache Spark 2.3 by Dong Joon Hyun
8
by Sean Owen
Welcoming Hyukjin Kwon and Sameer Agarwal as committers by Matei Zaharia
26
by Joseph Bradley
Spark 2.1.x client with 2.2.0 cluster by Ted Yu
1
by Saisai Shao
How can I test the expense of serialization and deserialization in spark? by 163
0
by 163
the uniqueSource in StreamExecution, where is it be changed please? by 萝卜丝炒饭
2
by 萝卜丝炒饭
Question, Flaky tests: pyspark.sql.tests.ArrowTests tests in Jenkins worker 5(?) by Hyukjin Kwon
4
by Hyukjin Kwon
[VOTE] [SPIP] SPARK-18085: Better History Server scalability by Marcelo Vanzin
10
by rxin
Re: Reparitioning Hive tables - Container killed by YARN for exceeding memory limits by Chetan Khatri
4
by Chetan Khatri
jenkins is going down NOW -- POWER OUTAGE DUE TO FIRE by shane knapp
2
by shane knapp
Some PRs not automatically linked to JIRAs by Bryan Cutler
5
by Hyukjin Kwon
Question about manually running dev/github_jira_sync.py by Hyukjin Kwon
0
by Hyukjin Kwon
Heap Settings for History Server by Neelesh Salian
0
by Neelesh Salian
Tests failing with run-tests.py SyntaxError by Sean Owen
7
by shane knapp
Failing to write a data-frame containing a UDT to parquet format by Erik Erlandson-2
0
by Erik Erlandson-2
Cosine similarity between documents (rows) - spark by Darsh
0
by Darsh
How to print content of each RDD in topics ? and How to convert data in topics into data frames ? by prudhviteddu
0
by prudhviteddu
(no subject) by Hao Chen
0
by Hao Chen
Interested in contributing to spark eco by Shashi Dongur
1
by rxin
Support Dynamic Partition Inserts params with SET command in Spark 2.0.1 by Chetan Khatri
3
by Chetan Khatri
Questions about Stateful Operations in SS by Zhang, Lubo
4
by Zhang, Lubo
Question on HashJoin trait by Chang Chen
0
by Chang Chen
Using UDFs in Java without registration by Justin Uang
4
by Justin Uang
Performance tracking by Kanielc
0
by Kanielc
Speeding up Catalyst engine by Maciej Bryński
2
by Maciej Bryński
Question on Spark code by tao zhan
5
by tao zhan
Output Committers for S3 by Matthew Schauer
19
by tafranky
Fwd: spark git commit: [SPARK-21472][SQL] Introduce ArrowColumnVector as a reader for Arrow vectors. by Jacek Laskowski
2
by Takuya UESHIN
Task partition ID in Spark event logs by Michael Mior
0
by Michael Mior
What does spark.python.worker.memory affect? by Cyanny LIANG
0
by Cyanny LIANG
How to tune the performance of Tpch query5 within Spark by 163
7
by 163
123456 ... 148