AbstractThis is the abstract.

Start with some statements about how bad cancer is.

Paragraph about the importance of mathematical modeling in understanding cancer and developing treatment strategies.

Paragraph about why properly describing tumor growth is important.

Paragraph about previous work in this field.

Summary/outline paragraph.

According to the CDC, cancer is the second leading cause of death in the United States (Citation?).

Understanding the pattern of growth in a tumor is principle in understanding how to treat cancer. Without a thorough understanding of the factors that cause tumor growth, treatment options are limited in scope. That is to say that the best way to slow or even stop tumor growth is to understand what causes it primarily. While there exist many promising treatments for tumor growth, any of these would be greatly improved by a grounding in empirically tested mathematical models. These mathematical models will allow research pertaining to cancer and tumor growth to find parameters for their proposed treatments and to test these proposed treatments quickly and efficiently. Such a mathematical model would need to be appropriately complex, based in empirical evidence, and able to give insights into the nature of cancer growth that are useful for research purposes.

In tumor growth mapping, there are multiple possible ways to model the growth rate. Different tumors behave differently, although we can find a general idea for a certain type of tumor.

\[\label{eq:ExponentialFit} \dot{V}=\lambda V\]

\[\label{eq:PowerRuleFit} \dot{V}=\lambda V^{a}\]

\[\label{eq:LogisticFit} \dot{V}=\lambda V\left(1-\frac{V}{K}\right)\]

\[\label{eq:LinearFit} \dot{V}=\frac{aV}{V+b}\]

\[\label{eq:SurfaceFit} \dot{V}=\frac{aV}{(V+b)^{\frac{1}{3}}}\]

\[\label{eq:GompertzFit} \dot{V}=aV\ln\left(\frac{b}{V+c}\right)\]

\[\label{eq:BertalanffyFit} \dot{V}=aV^{\frac{2}{3}}-bV\]

These are the seven best fits for a tumor. Once we get enough samples, we are able to select one equation for a generic case of a lung tumor, pancreatic tumor, or any other type. In order to understand what the equations mean, firstly we define \(\dot{V}\), the tumor cell population growth rate. Setting \(V\) as the tumor cell population, we define: \[\dot{V}=\frac{dV}{dt}\] It isn’t necessary to use \(\dot{V}\) instead of \(\frac{dV}{dt}\), but is helpful when writing out the equations. Each variable in these equations has some effect on the tumor growth. One example is \(\lambda\), the “intrinsic growth rate” of the tumor. Another is \(a\), a placeholder for an exponent. Then \(K\) is the “carrying capacity” of the tumor, and \(b\) is shorthand for \(\frac{1}{K}\), the inverse of the carrying capacity.

Now that we know what the equations define, we can differentiate between them. \eqref{eq:ExponentialFit} is the Exponential Fit Equation, named for having

After selecting the data, we fit each model to the data sets we gathered. For each of the seven models we tested, we gave each parameter the ability to change freely in a way so that the Sum of Squared Residuals was normalized and then minimized between the data and the curve. The normalization of the SSR values served the purpose of preventing the larger values at the end of data sets from being weighted more than those at the beginning. Without this normalization, many curves fit only to the initial and final points, while the data contained within these values were ignored by the minimization of the SSR function. The normalization applied an inverse square law to the SSR values so that the data contained in later values was reduced by a great amount and the data contained in early values was reduced by a small amount. The data as represented on our graphs was not normalized, only the SSR values that correspond to each point were.

To select the best fit model we used the Akaike Information Criterion corrected for finite sample sizes. This measure describes how much information is gained by extra parameters when fitting models. Therefore, a better AICc means a balance between the ability of a model to predict the data and the minimization of number of parameters. This measure has an advantage over similar equations, because the main limiting factor is the number of parameters. It is especially useful when compared to the Bayesian Information Criterion that operates under the assumption that the number of data points is much greater than the number of parameters, which is an issue when many sources for tumor growth data do not include enough data points to satisfy this condition. (Citation?) (Also a weirdly worded sentence, I’ll fix that). The AICc does require a certain number of data points, but is explicit in that if not enough data is used, the equation will render a zero in the denominator returning an error message. This is helpful in that it prevents conclusions from being drawn from unsubstantial data sets.

Describe general characteristics of the data sets, cancer type, substrate, etc.

Make a table with all the data sets (include in supplementary material, but refer to it here).

Make a bar graph of how many times each model is the best fit.

Discribe normalized duration and why we use it.

Make a bar graph or box & whiskers plot of normalized duration vs. model.

Make bar graphs of how many times each model is the best fit for in vitro and in vivo.

Discuss chi-squared and multinomial regression results.

Make bar graphs of how many times each model is the best fit for different types of cancer.

Discuss chi-squared and multinomial regression results.