Dr. Anton Babkin, Assistant Research Professor, Dept. of Agricultural and Resource Economics, University of Connecticut
Seminar Title: “Scaling Up Empirical Research to Bigger Data with Python”
Abstract:
This workshop is intended for researchers who have experience in analyzing data that comfortably fits in memory but are interested in scaling up to bigger than memory datasets. The following topics will be covered: measuring performance and memory usage; sampling and split-apply-combine strategy; data type optimization; efficient storage with parquet; simple parallelization; introduction to Dask. All workshop materials will be publicly available in this GitHub repository, including instructions on setting up a programming environment for those interested to follow along.
Wednesday, Jan 27, 2021 (Session 1)
Wednesday, Feb 3, 2021 (Session 2)
2:30pm – 3:30pm (EST)
*Link to Join*:
https://uconn-cmr.webex.com/uconn-cmr/j.php?MTID=me1476925402023dfa674617b826f7718
Meeting number (access code): 120 279 9820
Meeting password: PUp8PUpuJ64
For more information, contact: Tatiana Andreyeva at tatiana.andreyeva@uconn.edu