R on HPC Systems

Christopher Cameron, Adam Brazier, Linda Woodard (original author)
Cornell Center for Advanced Computing

Revisions: 10/2021, 5/2018, 7/2014 (original)

This topic focuses on how to run R in HPC environments, particularly on TACC's Stampede3 and Frontera supercomputers. It covers R configuration, interactive use on compute nodes via the command line and R Studio, and batch compute jobs.

Objectives

After you complete this topic, you should be able to:

  • Distinguish between RStats and RPackages
  • Explain how to download and install additional packages
  • Describe the benefits of running R on supercomputers
  • Explain the purpose of the TACC Visualization Portal
  • Distinguish between interactive and batch modes on Stampede2 and Frontera
  • Submit a job with Slurm
Prerequisites

This topic assumes the reader has no prior experience with R. The exercises and examples assume some familiarity with statistical analysis and working through the exercises on Stampede2 or Frontera requires a basic knowledge of Linux and the ability to access these systems via SSH.

Carrying out activities on Stampede2 or Frontera will require an appropriate TACC allocation. As an alternative, some activities could be carried out on a local installation of R and others on another HPC resource with Slurm and R installed.

 
©   Cornell University  |  Center for Advanced Computing  |  Copyright Statement  |  Inclusivity Statement