# Major Qualifying Project

The Major Qualifying Project (MQP) is a professional-level design or research experience completed by every WPI undergraduate. An integral element of WPI's project-based education, the immersive capstone project enables students to synthesize their learning by tackling and solving real-world problems in their fields of study.

CONTACT
Location: Unity Hall
Office Location: 4th Floor, Room 457 (Temporarily Moved Due to Stratton Hall Renovation)
Phone: 508-831-5241
Fax: 508-831-5824

## Actuarial Mathematics

Title: Copula Modeling:  An Application to Workers’ Compensation Claims
Students: Alison Lambert, Donovan Robillard, Lexi Ferrini

Copulas are multivariate probability distributions used in the modeling of multiple random variables. In insurance, they are used to create models that preserve the relationship between a claim’s loss amount and any associated expenses, especially in large loss scenarios. The goal of this project was to develop a copula model for losses and their associated expenses and determine whether their use produces different results than current modeling methods. Through data analysis and simulation, the team identified that a copula model could be applied to claims in Workers’ Compensation. It was found that the copula model did not produce significantly different results than those produced using the sponsor’s traditional methods, validating their current models.

Title: Trends in Health Reserves
Students: Danielle Rubin and Bianca Amarasekera

This paper evaluates reserving trends in the health insurance industry. Our analysis used 2016 to 2019 data from Annual Health Statements and includes 1,749 distinct health companies and subsidiaries in Medical, Medicare, Medicaid, and the Total of lines of business. We created and examined various distributions that describe the accuracy and conservativeness of the industry. Also, we analyzed how past reserving trends relate to future trends and identified various attributes that are correlated with reserve estimates. Lastly, using our analysis and 2016 to 2018 data, we created a model to forecast future reserve adequacy. We tested the model by predicting 2019 reserve accuracy and comparing it to the actual 2019 data.

## Analysis

Title: Stability Properties of a Crack Inverse Problem in Half Space
Student: Andrew Murdza (MA)

This project focuses on the analysis of a partial differential equation model relevant in the field of geophysics where sensors can capture seismic and displacement data. The question studied involves whether the geometry of faults, total slip between plates, and accumulated mechanical stress can be determined given data that results in an overdetermined system.

## Computational Biomechanics

Title: A Marker-Point Model for Simulation of Elastic Surface Deformation
Student: Kamryn P. Spinelli (MA)

This project developed a higher-accuracy method to simulate the deformation of rotationally-symmetric surfaces such as plant cells. Such a model increases computational efficiency and could be used to better understand how plant cells grow or how deformation can encourage diffusion on the cellular or nuclear level. The project incorporates elements of numerical analysis and differential geometry.

## Discrete Mathematics

Title: Tiling of Prime and Composite Kirchhoff Graphs
Student: Jessica Wang (MA)

A Kirchhoff graph is a connected vector graph (a graph whose edges are vectors) whose cycles are orthogonal to its vertex cuts. For a given matrix, the cycles of a Kirchhoff graph for the matrix form a basis for its null space, while the vertex cuts of the graph lie in its row space.  If the Kirchhoff graph is vector 2-connected, then all edge vectors will appear with the same multiplicity.  This property is used to develop an exhaustive search algorithm to find all the Kirchhoff graphs for the given matrix up to a certain multiplicity.  The algorithm in turn is used to study the structure of families of Kirchhoff graphs, particularly prime, composite and fundamental Kirchhoff graphs.  For a given matrix, there are no non-trivial Kirchhoff graphs smaller than a critical multiplicity, and a countable infinite number of prime Kirchhoff graphs whose multiplicity is some multiple of this critical multiplicity.

Title: The Calkin-Wilf Tree: Extensions and Applications
Student: Ben Gobler (MA)
Advisors: Brigitte Servatius and Herman Servatius

Continued fractions are of current interest in mathematics. In a recent publication, Jack E. Graver describes a method for computing terms in the Calkin-Wilf sequence, a list of the positive rationals introduced by Neil Calkin and Herbert S. Wilf in 2000. This paper explores an original method which uses continued fractions to evaluate and locate terms in the Calkin-Wilf sequence, as well as its natural extension to include all of the rational numbers. A generalization of the Calkin-Wilf tree leads to a characterization of rational numbers by continued fractions with integer coefficients. Finally, the meaning of infinite continued fractions and irrational numbers is studied using the structure of the Calkin-Wilf tree. We characterize the irrational numbers which have periodic continued fractions by developing a matrix representation of the setup, and we explain why irrational square root numbers have periodic continued fractions with palindromic coefficients.

## Financial Mathematics

Title: Deep Learning for Reflected Backwards Stochastic Differential Equations
Students: Frederick "Forrest" Miller

In this work, we in investigate the theory and numerics of reflected backwards stochastic differential equations (RBSDEs). We review important concepts from stochastic calculus, as well as key theoretical properties of (R)BSDEs. We provide an overview of feedforward neural networks and their applications to functional approximation for numerical implementations. We also discuss the key application of RBSDEs to the field of mathematical finance, in particular indifference pricing of put options. Lastly, we present preliminary theoretical and numerical results of Risk Indifference pricing of American from both the Buyer’s and Seller’s perspectives.

Title: Computations in Option Pricing Engines
Students: Vital Mendonca Filho (MA), Pavee Phongsopa (MA), Nicholas Wotton (MA)
Advisors: Yanhua Li (CS), Qingshuo Song (MA), Gu Wang (MA),

This project explores how machine learning techniques can be utilized in financial models such as option pricing methods.

## Financial Statistics

Title: Analyzing the Dynamic Relationship Between Intraday Trading Activity and Volatility Using High-Frequency Data
Students: Emily Baker (MA), Ryan Candy (MA), Isadora Coughlin (MA)
Advisors: Fangfang Wang (MA), Jian Zou (MA)

Measuring equity volatility is an important metric and understanding, describing, and predicting volatility using trading activity provides insight into navigating the stock market. Taking inspiration from existing research analyzing volatility in the stock market, we explore the dynamic relationship between trading volume, trading frequency, and volatility on an intraday basis across ten stocks in the consumer discretionary sector of the S&P 100 for the fourth quarter of 2013. Using three different volatility measures we implement variations of the heterogeneous autoregressive model and vector autoregressive model to investigate the lead-and-lag relationship between volatility and trading activity. Our quantitative analysis provides strong empirical evidence that current trading frequency and trading volume can be used to predict 30-minute measures of volatility and that the prior day rolling average and lagged trading measures are useful predictors in modeling the volatility measures.

## Machine Learning

Title: Dynamical Systems Approaches for Deep Learning
Student: Neil Kale

This study explores iterative neural networks (INNs), which reimagine neural network designs as iterated functions, and the recently introduced Sequential2D framework for neural networks that frames INN functions as left matrix multiplications for enhanced computational efficiency. We investigate the effects of sparse and low-rank matrix approximations on model performance, particularly focusing on sparsity and weight distribution using the MNIST Random Anomaly Task. Our results highlight the delicate balance between parallelization advantages and the need for equitable weight distribution. The comparison of sparse, low-rank, and dense matrices reveals low-rank matrices' role in boosting computational speed without drastically affecting model accuracy. Overall, this research advances our understanding of INNs and Sequential2D, underlining the significance of matrix representation methods in fine-tuning neural network architectures for improved performance and efficiency.

Title: Player Performance Prediction Automation for Draftkings
Students: Kayleigh Cambell (MA), Diego Gonzalez Villalobos (ECE), Benjamin Huang (ECE), Gabriel Katz (MA), Skylar O'Connell (CS)
Advisors: Donald Brown (ECE), Randy Paffenroth (MA)

In this Major Qualifying Project, we worked with DraftKings, an online daily fantasy sports company, to build a model that would predict the number of points, rebounds, and assists a NBA basketball player would score in a game, as well as the closing lines for NBA games. Our data sources were numerical and categorical data from DraftKings as well as data gathered from third-party sources. Then, we utilized algorithms such as linear regression, Lasso regression, random forest, and neural networks to predict the number of points, rebounds, and assists a NBA basketball player would score in a game, as well as the game lines for NBA games. We were able to reduce the standard deviation of our prediction error substantially for both game lines and for the number of points, rebounds, and assists a NBA basketball player would score in a game.

## Mathematical Biology

Title: Parameter Estimation of Cancer Cell Dynamics
Student: Lynne Moore (MA)
Advisors: Andrea Arnold and Sarah Olson
Co-Advisor: Mike Lee (UMass Medical School)

To screen different drugs as potential cancer therapies, high throughput experiments have been designed to study the dynamics of cell growth and death. Due to experimental costs and limitations, the cell data can only be recorded at a minimal number of time points. In this project, through parameter estimation techniques, we studied the optimal timing of data in order understand time points most relevant to capture cell dynamics.

## Probability and Statistics

Title: An M-Estimator for the Survivor Average Treatment Effect
Student: Laura Dufresne (MA and SSPS)

Randomized Controlled trials are considered the gold standard in program effectiveness and other causal research in the social and biomedical sciences because randomizing treatment assignment ensures that subjects in different treatment conditions are otherwise statistically equivalent. However, if some subjects drop out of a randomized study before completion, the remaining subjects may no longer be equivalent across treatment groups. In this study, we developed a novel estimator for the effect of an intervention on the subset of students who would remain in a study regardless of their treatment assignment and applied the method to estimated the effects of an educational technology application from an experiment that took place during the COVID pandemic, which led to missing outcome data for nearly half of participating students.

View additional MQP’s in our project database.

## Scientific Computation

Title: 3D Image Reconstruction of a Fossil Using Neutron Tomography
Students: Agustine Benjamin (PH), Scarlett Clarke (MA)