Skip to content

joseph-nagel/pyspark-playground

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PySpark playground

A playground for getting familiarized with PySpark is established. PySpark is the Python API for Apache Spark, an open-source framework for distributed big data processing. The contained examples widely follow the official documentation.

Getting started

  • PySpark can be easily installed via pip:

    pip install pyspark
    
  • An interactive shell session can be started by:

    pyspark
    

Notebooks

About

Getting started with PySpark

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published