< Back to list

Unleashing the Problem-Solving Potential of Next-Generation Data Scientists

bookSection

DOI:N/A
Authors: Carroll John M. / Zhu Lizhen / Wang James Z.

Extracted Abstract:

Data science, an emerging multidisciplinary field, resides at the intersec- tion of computational sciences, statistical modeling, and domain-specific sciences. The current norm for data science education predominantly focuses on graduate programs, which presume a preexisting knowledge base in one or more relevant sciences. However, this framework often overlooks those who don’t plan to pursue graduate studies, thereby limiting their exposure to this rapidly expanding field. Penn State addressed this gap by establishing one of the first undergraduate degree programs in data sciences, a collaboration between the College of Information Sciences and Technology, the Department of Computer Science and Engineering, and the Department of Statistics. One key component of this program is a project- focused, writing-intensive course designed for upper-class undergraduates. This course guides students through the entire data science project pipeline, from prob- lem identification to solution presentation. It allows students to apply foundational data science principles to real-world problems, advancing their understanding through practical application. This chapter details the objectives, rationale, and course design, alongside reflections from our teaching experience. The insights provided could be helpful to instructors developing similar data science programs or courses at an undergraduate level, broadening the influence of this important field. Keywords Undergraduate data science education · Project-based learning · Interdisciplinary curriculum · Interpersonal skills · Intrapersonal skills

Level 1: Include/Exclude

  • Papers must discuss situated information visualization* (by Willet et al.) in the application domain of CH.
    *A situated data representation is a data representation whose physical presentation is located close to the data’s physical referent(s).
    *A situated visualization is a situated data representation for which the presentation is purely visual – and is typically displayed on a screen.
  • Representation must include abstract data (e.g., metadata).
  • Papers focused solely on digital reconstruction without information visualization aspects are excluded.
  • Posters and workshop papers are excluded to focus on mature research contributions.
Show all meta-data