Course

Data Wrangling Basics

Nov 14, 2025 - Nov 14, 2025
1 credit

Enroll

Full course description

Term: Fall 2025

Date: November 14th, 2025

Time: 12:00 to 1:00 p.m.

Location: Online Only

Instructor: Matthew Brown

Presented By: Advanced Research Computing (ARC)

 

Description:

Prerequisites:
Have an ARC account enabling login to ARC systems
Basic familiarity with Unix Shell commands and navigating files and directories
Be connected to a VT network (e.g., on campus or connected to VPN)


Data storage and transfer are critical components of projects using ARC resources. This workshop will provide (1) an overview of ARC storage systems, describe their characteristics and intended uses, (2) demonstrate tools available for file transfers including command line tools, and Open OnDemand and Globus, and the best uses for each, (3) describe standard linux tools for packaging and compressing datasets and demonstrate the performance impact that this can have. We will practice transferring data with scp, rsync, and Globus. Through these examples, we will highlight best practices for different cases.

Sign up for this course today!

Enroll