This part of development you should serialize the python RDD to the JVM. This process uses the main development of Spark to call the jar function.
from pyspark.serializers import PickleSerializer, AutoBatchedSerializer
rdd = sc.parallelize(range(10000))
reserialized_rdd = rdd._reserialize(Aut...