Andrea Figueroa

PhD. Student in Human Centered Design & Engineering @UW

 About me

I'm currently a Research Assistant and PhD Student in Human Centered Design & Engineering (HCDE) and a member of the Human-Centered Data Science Lab at University of Washington, Seattle. I received my bachelor's and master's degrees in Informatics Engineering from Universidad Técnica Federico Santa María (UTFSM) in Chile.

My research interest is focused on Human Centered Data Science and Data Visualization. I'm interested in building tools to help researchers with qualitative and automated analysis of large datasets, supporting discussions to improve inter-rater agreement by creating powerful data visualizations, inter-rater reliability metrics, and algorithms to uncover important information in the process.

 Education

University of Washington

2019-2024

Phd. in Human Centered Design and Engineering

Universidad Técnica Federico Santa María

2016-2018

Msc. in Computer Engineering

Universidad Técnica Federico Santa María

2010-2016

Bsc. in Informatics Engineering

 Teaching

Instructor

UW

  • HCDE 511: Information Visualization - Winter 2021
  • HCDE 511: Information Visualization - Spring 2021

Instructor

UTFSM

  • Computer Programming - 1st Semester 2018 & 2019
  • Data Visualization - 2nd Semester 2018

Teaching Assistant

UW

  • Information Visualization (DATA 511)
    Prof: Nathan Mannheimer - Fall 2021
  • Information Visualization (DATA 511)
    Prof: Nathan Mannheimer - Summer 2021
  • Information Visualization (HCDE 411)
    Prof: Brock Craft - Fall 2020
  • Information Visualizacion (HCDE 511)
    Prof: Cecilia Aragon - Winter 2020

Teaching Assistant

UTFSM

  • Data Visualization - Prof: Cecilia Aragon - 1 semester
  • Artificial Intelligence - Prof: María-Cristina Riff - 4 semesters
  • Databases - Prof: Cecilia Reyes - 8 semesters
  • LabComp - Student-based Computer Lab - 6 semesters

 Research

  • Generalized Cohen’s Kappa: a new Metric for Inter-rater Reliability Assessment for Collaborative Coding
    Advisor: Cecilia Aragon; January 2020 - Ongoing

    Collaborative coding of large datasets of short texts, such as tweets or comments, has been a valuable tool for qualitative research for years. When it comes to assessing the inter-rater reliability of multiple coders, existing metrics have not evolved to fit current data and present a variety of restrictions. We propose a generalized Cohen’s Kappa based on Monte Carlo simulations that can be applied to large datasets with multiple coders and non mutually exclusive categories. Preliminar results show that this new metric can replace the widely used Cohen’s kappa metric, obtaining similar results in small and restricted settings and allowing flexibility in larger settings with complex data.


  • Text-prizm 2.0 - Advisor: Cecilia Aragon; June 2018 - Ongoing

    Qualitative coding is a labor-intensive process of manually reading and interpreting large amounts of data, many researchers use tools like Microsoft Excel or Google Spreadsheets for this task as most of the qualitative coding tools are not adapted to large online communication datasets, these tools can be effective but lack many useful features to make coding easier and more effective. Text-prizm is a web application for collaborative coding of large volumes of short text messages, its interface is built using a human-centered approach and aims to facilitate the qualitative coding and analysis of large online communication datasets.


  • Cultural Differences in Data Privacy Perspectives on Social Media - Advisors: Cecilia Aragon, Claudia López; Sept 2018 - Dec 2019

    The Cambridge Analytica scandal has triggered a discussion about data privacy in social media. Motivated by this context, we aim to answer this research question: Does the public online debate reveal different perspectives on data privacy across countries/cultures? A large-scale Twitter dataset around this issue with both English and Spanish tweets has been collected and we aim to analyze the data through both qualitative coding and automated analysis.


 Skills

  • Coding: C/C++, Python, Ruby, Javascript, PHP, R
  • Web Development: Ruby On Rails, JS/Jquery, HTML/CSS
  • Data Visualization: Tableau, D3, Plotly, Altair, Matplotlib
  • Data Manipulation: Tableau Prep, Pandas, NumPy, NLTK
  • Back-end: GNU/Linux, Nginx, MariaDB, PostgreSQL, Git, Scripting/Bash
  • Design: LaTeX, Adobe Photoshop, Gimp

 Publications

Peer-reviewed conference papers and posters


Workshop papers

  • "The Lineage of Human-Centered Data Science."
    Andrea Figueroa. Interrogating Data Science Workshop, CSCW 2020: ACM Conference on Computer Supported Cooperative Work (2020).

  • "Creating a Labeled Dataset of Cross-Language Data Privacy Reactions."
    Andrea Figueroa, Felipe González, Claudia López, and Cecilia Aragon. Mapping Out Human-Centered Data Science: Methods, Approaches, and Best Practices Workshop, GROUP 2020: ACM International Conference on Supporting Group Work, Sanibel Island, FL (2020).


In progress

  • "Generalized Cohen’s Kappa: a new Metric for Inter-rater Reliability Assessment for Collaborative Coding."
    Andrea Figueroa, Sourojit Ghosh, and Cecilia Aragon.

Get in touch