Tutorial by Examples

When asking Apache Spark related questions please include following information Apache Spark version used by the client and Spark deployment if applicable. For API related questions major (1.6, 2.0, 2.1 etc.) is typically sufficient, for questions concerning possible bugs always use full version ...
Example Data Please try to provide a minimal example input data in a format that can be directly used by the answers without tedious and time consuming parsing for example input file or local collection with all code required to create distributed data structures. When applicable always include ty...
Debugging questions. When question is related to debugging specific exception always provide relevant traceback. While it is advisable to remove duplicated outputs (from different executors or attempts) don't cut tracebacks to a single line or exception class only. Performance questions. Dependin...
Search Stack Overflow for duplicate questions. There common class of problems which have been already extensively documented. Read How do I ask a good question?. Read What topics can I ask about here? Apache Spark Community resources

Page 1 of 1