IPv6 Standalone Mode

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view

IPv6 Standalone Mode

My team and I are attempting to run Spark Standalone on IPv6-first
infrastructure. This requires that all RPC listeners bind IPv6 sockets e.g.
`:::7077` instead of ``. Initial experimentation has found
that Spark 2.4.4 doesn't currently handle this scenario. Various host/bind
addresses can be configured e.g. `spark.driver.host` `SPARK_LOCAL_IP` e.t.c.
however these must be set to host names and will fail to start otherwise. To
get *something* running, I have implemented some 'hacks' (link to diff
https://gist.github.com/SteadBytes/b206335c8e2429486a13421b7cfcdc88) which
force the desired IPv6 bindings which, in combination with using FQDNs for
advertised addresses does work .

However this is clearly not a feasible solution to the issue and I'm looking
for some guidance from those more experienced with the codebase on how to
proceed. Is there a better approach without modifying Spark itself that I
have simply missed? If not, is this support something that would be desired
and how might it best be implemented? I am prepared to contribute the work
and upstream changes that we may need to make for this.

Ben Steadman
Software Engineer at Unipart Digital

Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/

To unsubscribe e-mail: [hidden email]