# Major Qualifying Project

The Major Qualifying Project (MQP) is a professional-level design or research experience completed by every WPI undergraduate. An integral element of WPI's project-based education, the immersive capstone project enables students to synthesize their learning by tackling and solving real-world problems in their fields of study.

**MQPs completed by Actuarial Mathematics (MAC) and Mathematical Sciences (MA) students:**

## Actuarial Mathematics

**Title:** Copula Modeling: An Application to Workers’ Compensation Claims**Students:** Alison Lambert, Donovan Robillard, Lexi Ferrini**Advisors:** Jon Abraham, Barry Posterro**Sponsor:** Hanover Insurance Group**Academic Year:** 2021-2022

Copulas are multivariate probability distributions used in the modeling of multiple random variables. In insurance, they are used to create models that preserve the relationship between a claim’s loss amount and any associated expenses, especially in large loss scenarios. The goal of this project was to develop a copula model for losses and their associated expenses and determine whether their use produces different results than current modeling methods. Through data analysis and simulation, the team identified that a copula model could be applied to claims in Workers’ Compensation. It was found that the copula model did not produce significantly different results than those produced using the sponsor’s traditional methods, validating their current models.

**Title:** Trends in Health Reserves**Students:** Danielle Rubin and Bianca Amarasekera**Advisors:** Jon Abraham, Barry Posterro**Sponsor:** Milliman**Academic Year:** 2020-2021

This paper evaluates reserving trends in the health insurance industry. Our analysis used 2016 to 2019 data from Annual Health Statements and includes 1,749 distinct health companies and subsidiaries in Medical, Medicare, Medicaid, and the Total of lines of business. We created and examined various distributions that describe the accuracy and conservativeness of the industry. Also, we analyzed how past reserving trends relate to future trends and identified various attributes that are correlated with reserve estimates. Lastly, using our analysis and 2016 to 2018 data, we created a model to forecast future reserve adequacy. We tested the model by predicting 2019 reserve accuracy and comparing it to the actual 2019 data.

## Analysis

**Title:** Stability Properties of a Crack Inverse Problem in Half Space**Student:** Andrew Murdza (MA)**Advisor:** Darko Volkov**Academic Year:** 2019-2020

This project focuses on the analysis of a partial differential equation model relevant in the field of geophysics where sensors can capture seismic and displacement data. The question studied involves whether the geometry of faults, total slip between plates, and accumulated mechanical stress can be determined given data that results in an overdetermined system.

## Computational Biomechanics

**Title:** A Marker-Point Model for Simulation of Elastic Surface Deformation**Student:** Kamryn P. Spinelli (MA)**Advisor: **Min Wu**Academic Year:** 2020-2021

This project developed a higher-accuracy method to simulate the deformation of rotationally-symmetric surfaces such as plant cells. Such a model increases computational efficiency and could be used to better understand how plant cells grow or how deformation can encourage diffusion on the cellular or nuclear level. The project incorporates elements of numerical analysis and differential geometry.

## Discrete Mathematics

**Title: **Tiling of Prime and Composite Kirchhoff Graphs**Student:** Jessica Wang (MA)**Advisors: **Joseph D. Fehribach (MA)**Academic Year:** 2021-2022

A Kirchhoff graph is a connected vector graph (a graph whose edges are vectors) whose cycles are orthogonal to its vertex cuts. For a given matrix, the cycles of a Kirchhoff graph for the matrix form a basis for its null space, while the vertex cuts of the graph lie in its row space. If the Kirchhoff graph is vector 2-connected, then all edge vectors will appear with the same multiplicity. This property is used to develop an exhaustive search algorithm to find all the Kirchhoff graphs for the given matrix up to a certain multiplicity. The algorithm in turn is used to study the structure of families of Kirchhoff graphs, particularly prime, composite and fundamental Kirchhoff graphs. For a given matrix, there are no non-trivial Kirchhoff graphs smaller than a critical multiplicity, and a countable infinite number of prime Kirchhoff graphs whose multiplicity is some multiple of this critical multiplicity.

**Title: **The Calkin-Wilf Tree: Extensions and Applications**Student:** Ben Gobler (MA)**Advisors: **Brigitte Servatius and Herman Servatius**Academic Year:** 2021-2022

Continued fractions are of current interest in mathematics. In a recent publication, Jack E. Graver describes a method for computing terms in the Calkin-Wilf sequence, a list of the positive rationals introduced by Neil Calkin and Herbert S. Wilf in 2000. This paper explores an original method which uses continued fractions to evaluate and locate terms in the Calkin-Wilf sequence, as well as its natural extension to include all of the rational numbers. A generalization of the Calkin-Wilf tree leads to a characterization of rational numbers by continued fractions with integer coefficients. Finally, the meaning of infinite continued fractions and irrational numbers is studied using the structure of the Calkin-Wilf tree. We characterize the irrational numbers which have periodic continued fractions by developing a matrix representation of the setup, and we explain why irrational square root numbers have periodic continued fractions with palindromic coefficients.

## Financial Mathematics

**Title**: Deep Learning for Reflected Backwards Stochastic Differential Equations**Students**: Frederick "Forrest" Miller**Advisors**: Stephan Sturm**Academic Year**: 2022-2023

In this work, we in investigate the theory and numerics of reflected backwards stochastic differential equations (RBSDEs). We review important concepts from stochastic calculus, as well as key theoretical properties of (R)BSDEs. We provide an overview of feedforward neural networks and their applications to functional approximation for numerical implementations. We also discuss the key application of RBSDEs to the field of mathematical finance, in particular indifference pricing of put options. Lastly, we present preliminary theoretical and numerical results of Risk Indifference pricing of American from both the Buyer’s and Seller’s perspectives.

**Title**: Computations in Option Pricing Engines**Students**: Vital Mendonca Filho (MA), Pavee Phongsopa (MA), Nicholas Wotton (MA)**Advisors**: Yanhua Li (CS), Qingshuo Song (MA), Gu Wang (MA),**Academic Year**: 2019-2020

This project explores how machine learning techniques can be utilized in financial models such as option pricing methods.

## Financial Statistics

**Title**: Analyzing the Dynamic Relationship Between Intraday Trading Activity and Volatility Using High-Frequency Data**Students**: Emily Baker (MA), Ryan Candy (MA), Isadora Coughlin (MA)**Advisors**: Fangfang Wang (MA), Jian Zou (MA)**Academic Year**: 2020-2021

Measuring equity volatility is an important metric and understanding, describing, and predicting volatility using trading activity provides insight into navigating the stock market. Taking inspiration from existing research analyzing volatility in the stock market, we explore the dynamic relationship between trading volume, trading frequency, and volatility on an intraday basis across ten stocks in the consumer discretionary sector of the S&P 100 for the fourth quarter of 2013. Using three different volatility measures we implement variations of the heterogeneous autoregressive model and vector autoregressive model to investigate the lead-and-lag relationship between volatility and trading activity. Our quantitative analysis provides strong empirical evidence that current trading frequency and trading volume can be used to predict 30-minute measures of volatility and that the prior day rolling average and lagged trading measures are useful predictors in modeling the volatility measures.

## Machine Learning

**Title**: Player Performance Prediction Automation for Draftkings**Students**: Kayleigh Cambell (MA), Diego Gonzalez Villalobos (ECE), Benjamin Huang (ECE), Gabriel Katz (MA), Skylar O'Connell (CS)**Advisors**: Donald Brown (ECE), Randy Paffenroth (MA)**Sponsor: **Draftkings**Academic Year**: 2020-2021

In this Major Qualifying Project, we worked with DraftKings, an online daily fantasy sports company, to build a model that would predict the number of points, rebounds, and assists a NBA basketball player would score in a game, as well as the closing lines for NBA games. Our data sources were numerical and categorical data from DraftKings as well as data gathered from third-party sources. Then, we utilized algorithms such as linear regression, Lasso regression, random forest, and neural networks to predict the number of points, rebounds, and assists a NBA basketball player would score in a game, as well as the game lines for NBA games. We were able to reduce the standard deviation of our prediction error substantially for both game lines and for the number of points, rebounds, and assists a NBA basketball player would score in a game.

## Mathematical Biology

**Title:** Parameter Estimation of Cancer Cell Dynamics**Student**: Lynne Moore (MA)**Advisors**: Andrea Arnold and Sarah Olson**Co-Advisor: **Mike Lee (UMass Medical School)**Academic Year**: 2019-2020

To screen different drugs as potential cancer therapies, high throughput experiments have been designed to study the dynamics of cell growth and death. Due to experimental costs and limitations, the cell data can only be recorded at a minimal number of time points. In this project, through parameter estimation techniques, we studied the optimal timing of data in order understand time points most relevant to capture cell dynamics.

## Probability and Statistics

**Title:** Incorporation of PPI in LRT**Student**: Dongni Zhang (MA)**Advisor**: Zheyang Wu**Academic Year**: 2014-2015

Statistical association studies have contributed significantly in the detection of novel genetic factors associated with complex diseases. Incorporation of biological information that reflects the complex mechanism of disease development is likely to increase the power of association tests. In this study, we develop a statistical framework for association studies that integrates the information of the functional effect of SNPs to the disease related protein-protein interactions. Based on both real and simulated phenotypes of hypertension, the method is compared with multiple well-known association tests for sequencing data.

View additional MQP’s in our project database.