We cover the integration of existing USGS HPC environment with Big Data software like Apache Hadoop and Spark using open-source software Magpie from Lawrence Livermore National Laboratory. The diverse examples on USGS supercomputer use both interactive PySpark shell and sbatch script submission with SLURM.