Hi! I'm Diana Xu

data analyst & software engineer

I developed scalable web applications and analyzed real-world big data in both startup and industry environments.

contact me my resume

Who I am

About me

user

Hi, my name is Diana Xu, and I got my CS Master degree from Cornell recently. I bring hands-on experience in developing web apps, analyzing real-world data and buildling data pipelines. I’m driven to solve complex problems with reliable, effective solutions and continuously seek opportunities to learn and grow. I’m looking for full-time roles in software engineering, particularly in frontend, full-stack, web development, or data analysis, but also open to other related positions!

  • Languages

    Languages

    JavaScript, Python, C++, React Js, Next Js, Node Js, Flask, RESTful API, HTML, CSS, Tailwind CSS, SQL, etc.

  • Education

    Education

    2024 Cornell MEng in Computer Science, 2023 UMich BS in Computer Science and Data Science

  • Projects

    Projects

    Including med-tracking portal, research data analysis tool, book-award ML prediction, etc.

Tools I use

  • Tool
  • Tool
  • Tool
  • Tool
  • Tool
  • Tool

What I'm interested in

My Specialities

Web app

Developed > 5 web applications from school projects to intern tasks

Algorithm

Had solid knowledge and practical experience with object-oriented programming, data structures and algorithms

Data analysis

Experienced cleaning and analyzing 10+ millions different-type raw data

UI design

Designed watch, phone, and web platforms with Figma

What I did

My projects

Slide 1 for Hugmed

Hugmed

Med-tracking Portal | Startup Project | Jan 2024 - Present

React JsNode JsJavaScriptSQLPostgreSQLRESTful APISupabaseVercelAWSHTMLCSS
  • Sign-In / Sign-up: A whole set of secure sign-in & sign-up with required HIPAA consent form
  • Email Verification: Identity confirmation via secure SMTP link
  • Medication Dashboard: Displays medications, logs, and statistics
  • Medication Management: Add, delete, or update medication details
  • Weekly Reports: Visualizes adherence with downloadable PDFs
  • Help Center: Searchable FAQs with instructional videos
  • Settings Page: Update profile and manage data-sharing preferences with caregivers
Slide 1 for Portfolio

Portfolio

My Personal Portfolio Website | Personal Project | Jan 2025 - Present

Live demo for PortfolioGitHub repository for Portfolio
React JsNext JsJavaScriptVercelHTMLTailwind CSS

This portfolio website is my way of showcasing my skills while diving into new challenges and learning along the way! I discovered how powerful and flexible Tailwind CSS is, using it to create fun animations and unique styles effortlessly. I also focused on making the site accessible and responsive, with features like a navigation bar that magically adapts to any screen size. This project has been such an exciting journey, blending creativity and problem-solving to build something intuitive, inclusive, and truly me!

  • Responsive design ensuring seamless functionality across mobile and desktop devices
  • Image carousel for showcasing projects with smooth transitions
  • Interactive hover animations enhancing user engagement with modules
  • Adaptive hamburger menu and side navigation bar for compact screens
  • Intuitive navigation with tags directing users to specific page sections
Slide 1 for NMA Tool - Sumitovant Biopharma

NMA Tool - Sumitovant Biopharma

Automated Pipeline for Data Insights | Intern Project | May 2022 - Aug 2022

PythonFlaskRHTMLCSSXML

Driven by my passion for big data and a desire to simplify complex processes, my two-term internship at Sumitovant Biopharma was an incredible opportunity to transform how data analysis is done. I developed an in-house tool in Python and R (connected with rpy2) to streamline network meta-analysis (NMA) on clinical data, ranking treatments across multiple studies. The tool tackled challenges like inconsistent treatment names using predictive similarity scoring with 95% accuracy and automated data transformation. It also integrates proper analysis model for different data types (binary, proportional, numerical, etc.) and delivers results within seconds, making data analysts’ work faster and more efficient.

  • Extract and process data from hundreds of ClinicalTrials.gov XML files
  • Predict and standardize treatment names to align with the Cortellis drug list
  • Automate the generation of comprehensive modeling result in one click
Slide 1 for Newsearch

Newsearch

News Search Engine | School Project | Aug 2023 - Dec 2023

Live demo for Newsearch
React JsNext JsJavaScriptHTMLCSSFirebase

This news search platform, built with React and Next.js, brings the latest New York Times articles right to fingertips through the NY Times API! Beyond just browsing, it lets users save their favorite articles and offers a seamless, personalized experience. When people are logged in, the platform—powered by Firebase for web analysis and Firestore for the non-relational database —remembers the search history, which users can edit anytime. It’s designed to make revisiting topics or rediscovering past interests effortless and enjoyable.

  • Search functionality to retrieve New York Times articles by keywords
  • Dedicated user portal for saving and managing favorite articles
  • Provide redirection options to access the original news source
Slide 1 for Insta485

Insta485

Instagram-like Web System | School Project | Aug 2022 - Dec 2022

GitHub repository for Insta485
React JsFlaskPythonJSONSQLiteAWS

This is an Instagram clone project for the EECS485 course in Umich, the first course for me to learn how to build a web application. Creating this was an amazing learning experience to develop both my front-end and back-end skills. React was used to allow for client-side dynamic pages on the home screen of Instagram and also for individual posts. Python and Flask were used to construct REST API, Flask cookies were used to store usernames and to authenticate users, and SQL was used for database access and storage. The web app was finally deployed with an AWS EC2 instance.

  • Creating accounts, following, liking, commenting, uploading photos, and more
  • Double-click on an unliked image to like it
  • Immediately show new likes and comments
  • Infinite scroll by displaying the next 10 posts when scrolling to the bottom of the page
Slide 1 for MDP - ProQuest

MDP - ProQuest

Book-award ML Prediction | Research Project | Jan 2022 - Dec 2022

DaskNumpyPandasScikit-learnPythonJupyter Notebook

As part of the UMich Multidisciplinary Design Program sponsored by ProQuest, we developed a machine learning model to predict the likelihood of books winning awards, designed for deployment on Rialto, an academic marketplace for data-driven librarian decisions. Tackling challenges like 0.03% award-winning books in a 10M+ dataset and 80% missing data for numerous fields, we invested significant effort in merging datasets, handling missing values, and transforming raw data into usable insights. We analyzed 6 data sources and there were a total of 75 features, among which we selected or used to create 6 features for our final model. Through rigorous preprocessing, time-series analysis, and Random Forest Regression, we achieved a 31% match rate between predicted and actual top 100 books, highlighting the strength of our data preparation and modeling pipeline.

Connect with me

Get in touch

Feel free to contact me! You can email me at:

diana.xu.cornell@gmail.com