Statistics
Permanent URI for this community
Browse
Browsing Statistics by Title
Now showing 1 - 20 of 189
Results Per Page
Sort Options
Item A Bayesian Integrative Model for Genetical Genomics with Spatially Informed Variable Selection(Libertas Academica, 2014) Cassese, Alberto; Guindani, Michele; Vannucci, MarinaWe consider a Bayesian hierarchical model for the integration of gene expression levels with comparative genomic hybridization (CGH) array measurements collected on the same subjects. The approach defines a measurement error model that relates the gene expression levels to latent copy number states. In turn, the latent states are related to the observed surrogate CGH measurements via a hidden Markov model. The model further incorpo-rates variable selection with a spatial prior based on a probit link that exploits dependencies across adjacent DNA segments. Posterior inference is carried out via Markov chain Monte Carlo stochastic search techniques. We study the performance of the model in simulations and show better results than those achieved with recently proposed alternative priors. We also show an application to data from a genomic study on lung squamous cell carcinoma, where we identify potential candidates of associations between copy number variants and the transcriptional activity of target genes. Gene ontology (GO) analyses of our findings reveal enrichments in genes that code for proteins involved in cancer. Our model also identifies a number of potential candidate biomarkers for further experimental validation.Item A Bayesian model for the identification of differentially expressed genes in Daphnia magna exposed to munition pollutants(Wiley, 2015) Cassese, Alberto; Guindani, Michele; Antczak, Philipp; Falciani, Francesco; Vannucci, MarinaIn this article we propose a Bayesian hierarchical model for the identification of differentially expressed genes in Daphnia magna organisms exposed to chemical compounds, specifically munition pollutants in water. The model we propose constitutes one of the very first attempts at a rigorous modeling of the biological effects of water purification. We have data acquired from a purification system that comprises four consecutive purification stages, which we refer to as "ponds," of progressively more contaminated water. We model the expected expression of a gene in a pond as the sum of the mean of the same gene in the previous pond plus a gene-pond specific difference. We incorporate a variable selection mechanism for the identification of the differential expressions, with a prior distribution on the probability of a change that accounts for the available information on the concentration of chemical compounds present in the water. We carry out posterior inference via MCMC stochastic search techniques. In the application, we reduce the complexity of the data by grouping genes according to their functional characteristics, based on the KEGG pathway database. This also increases the biological interpretability of the results. Our model successfully identifies a number of pathways that show differential expression between consecutive purification stages. We also find that changes in the transcriptional response are more strongly associated to the presence of certain compounds, with the remaining contributing to a lesser extent. We discuss the sensitivity of these results to the model parameters that measure the influence of the prior information on the posterior inference.Item A Bayesian Nonparametric Approach for Functional Data Classification with Application to Hepatic Tissue Characterization(Libertas Academica, 2015) Fronczyk, Kassandra M.; Guindani, Michele; Hobbs, Brian P.; Ng, Chaan S.; Vannucci, MarinaComputed tomography perfusion (CTp) is an emerging functional imaging technology that provides a quantitative assessment of the passage of fluid through blood vessels. Tissue perfusion plays a critical role in oncology due to the proliferation of networks of new blood vessels typical of cancer angiogenesis, which triggers modifications to the vasculature of the surrounding host tissue. In this article, we consider a Bayesian semiparametric model for the analysis of functional data. This method is applied to a study of four interdependent hepatic perfusion CT characteristics that were acquired under the administration of contrast using a sequence of repeated scans over a period of 590 seconds. More specifically, our modeling framework facilitates borrowing of information across patients and tissues. Additionally, the approach enables flexible estimation of temporal correlation structures exhibited by mappings of the correlated perfusion biomarkers and thus accounts for the heteroskedasticity typically observed in those measurements, by incorporating change-points in the covariance estimation. This method is applied to measurements obtained from regions of liver surrounding malignant and benign tissues, for each perfusion biomarker. We demonstrate how to cluster the liver regions on the basis of their CTp profiles, which can be used in a prediction context to classify regions of interest provided by future patients, and thereby assist in discriminating malignant from healthy tissue regions in diagnostic settings.Item A Bayesian Nonparametric Spiked Process Prior for Dynamic Model Selection(Project Euclid, 2019) Cassese, Alberto; Zhu, Weixuan; Guindani, Michele; Vannucci, MarinaIn many applications, investigators monitor processes that vary in space and time, with the goal of identifying temporally persistent and spatially localized departures from a baseline or “normal” behavior. In this manuscript, we consider the monitoring of pneumonia and influenza (P&I) mortality, to detect influenza outbreaks in the continental United States, and propose a Bayesian nonparametric model selection approach to take into account the spatio-temporal dependence of outbreaks. More specifically, we introduce a zero-inflated conditionally identically distributed species sampling prior which allows borrowing information across time and to assign data to clusters associated to either a null or an alternate process. Spatial dependences are accounted for by means of a Markov random field prior, which allows to inform the selection based on inferences conducted at nearby locations. We show how the proposed modeling framework performs in an application to the P&I mortality data and in a simulation study, and compare with common threshold methods for detecting outbreaks over time, with more recent Markov switching based models, and with spike-and-slab Bayesian nonparametric priors that do not take into account spatio-temporal dependence.Item A convex-nonconvex strategy for grouped variable selection(Project Euclid, 2023) Liu, Xiaoqian; Molstad, Aaron J.; Chi, Eric C.This paper deals with the grouped variable selection problem. A widely used strategy is to augment the negative log-likelihood function with a sparsity-promoting penalty. Existing methods include the group Lasso, group SCAD, and group MCP. The group Lasso solves a convex optimization problem but suffers from underestimation bias. The group SCAD and group MCP avoid this estimation bias but require solving a nonconvex optimization problem that may be plagued by suboptimal local optima. In this work, we propose an alternative method based on the generalized minimax concave (GMC) penalty, which is a folded concave penalty that maintains the convexity of the objective function. We develop a new method for grouped variable selection in linear regression, the group GMC, that generalizes the strategy of the original GMC estimator. We present a primal-dual algorithm for computing the group GMC estimator and also prove properties of the solution path to guide its numerical computation and tuning parameter selection in practice. We establish error bounds for both the group GMC and original GMC estimators. A rich set of simulation studies and a real data application indicate that the proposed group GMC approach outperforms existing methods in several different aspects under a wide array of scenarios.Item A CRISPR toolbox for generating intersectional genetic mouse models for functional, molecular, and anatomical circuit mapping(Springer Nature, 2022) Lusk, Savannah J.; McKinney, Andrew; Hunt, Patrick J.; Fahey, Paul G.; Patel, Jay; Chang, Andersen; Sun, Jenny J.; Martinez, Vena K.; Zhu, Ping Jun; Egbert, Jeremy R.; Allen, Genevera; Jiang, Xiaolong; Arenkiel, Benjamin R.; Tolias, Andreas S.; Costa-Mattioli, Mauro; Ray, Russell S.The functional understanding of genetic interaction networks and cellular mechanisms governing health and disease requires the dissection, and multifaceted study, of discrete cell subtypes in developing and adult animal models. Recombinase-driven expression of transgenic effector alleles represents a significant and powerful approach to delineate cell populations for functional, molecular, and anatomical studies. In addition to single recombinase systems, the expression of two recombinases in distinct, but partially overlapping, populations allows for more defined target expression. Although the application of this method is becoming increasingly popular, its experimental implementation has been broadly restricted to manipulations of a limited set of common alleles that are often commercially produced at great expense, with costs and technical challenges associated with production of intersectional mouse lines hindering customized approaches to many researchers. Here, we present a simplified CRISPR toolkit for rapid, inexpensive, and facile intersectional allele production.Item A factorial analysis of the combined effects of hydrogel fabrication parameters on the in vitro swelling and degradation of oligo(poly(ethylene glycol) fumarate) hydrogels(Wiley, 2014) Lam, Johnny; Kim, Kyobum; Lu, Steven; Tabata, Yasuhiko; Scott, David W.; Mikos, Antonios G.; Kasper, F. KurtisIn this study, a full factorial approach was used to investigate the effects of poly(ethylene glycol) (PEG) molecular weight (MW; 10,000 vs. 35,000 nominal MW), crosslinker-to-macromer carbon–carbon double bond ratio (DBR; 40 vs. 60), crosslinker type (PEG-diacrylate (PEGDA) vs. N,N′-methylene bisacrylamide (MB)), crosslinking extent of incorporated gelatin microparticles (low vs. high), and incubation medium composition (with or without collagenase) on the swelling and degradation characteristics of oligo[(poly(ethylene glycol) fumarate)] (OPF) hydrogel composites as indicated by the swelling ratio and the percentage of mass remaining, respectively. Each factor consisted of two levels, which were selected based on previous in vitro and in vivo studies utilizing these hydrogels for various tissue engineering applications. Fractional factorial analyses of the main effects indicated that the mean swelling ratio and the mean percentage of mass remaining of OPF composite hydrogels were significantly affected by every factor. In particular, increasing the PEG chain MW of OPF macromers significantly increased the mean swelling ratio and decreased the mean percentage of mass remaining by 5.7 ± 0.3 and 17.2 ± 0.6%, respectively. However, changing the crosslinker from MB to PEGDA reduced the mean swelling ratio and increased the mean percentage of mass remaining of OPF composite hydrogels by 4.9 ± 0.2 and 9.4 ± 0.9%, respectively. Additionally, it was found that the swelling characteristics of hydrogels fabricated with higher PEG chain MW or with MB were more sensitive to increases in DBR. Collectively, the main and cross effects observed between factors enables informed tuning of the swelling and degradation properties of OPF-based hydrogels for various tissue engineering applications.Item A genome-wide trans-ethnic interaction study links theᅠPIGR-FCAMRᅠlocus to coronary atherosclerosis via interactions between genetic variants and residential exposure to traffic(Public Library of Science, 2017) Ward-Caviness, Cavin K.; Neas, Lucas M.; Blach, Colette; Haynes, Carol S.; LaRocque-Abramson, Karen; Grass, Elizabeth; Dowdy, Z. Elaine; Devlin, Robert B.; Diaz-Sanchez, David; Cascio, Wayne E.; Miranda, Marie Lynn; Gregory, Simon G.; Shah, Svati H.; Kraus, William E.; Hauser, Elizabeth R.Air pollution is a worldwide contributor to cardiovascular disease mortality and morbidity. Traffic-related air pollution is a widespread environmental exposure and is associated with multiple cardiovascular outcomes such as coronary atherosclerosis, peripheral arterial disease, and myocardial infarction. Despite the recognition of the importance of both genetic and environmental exposures to the pathogenesis of cardiovascular disease, studies of how these two contributors operate jointly are rare. We performed a genome-wide interaction study (GWIS) to examine gene-traffic exposure interactions associated with coronary atherosclerosis. Using race-stratified cohorts of 538 African-Americans (AA) and 1562 European-Americans (EA) from a cardiac catheterization cohort (CATHGEN), we identify gene-by-traffic exposure interactions associated with the number of significantly diseased coronary vessels as a measure of chronic atherosclerosis. We found five suggestive (P<1x10-5) interactions in the AA GWIS, of which two (rs1856746 and rs2791713) replicated in the EA cohort (P < 0.05). Both SNPs are in the PIGR-FCAMR locus and are eQTLs in lymphocytes. The protein products of both PIGR and FCAMR are implicated in inflammatory processes. In the EA GWIS, there were three suggestive interactions; none of these replicated in the AA GWIS. All three were intergenic; the most significant interaction was in a regulatory region associated with SAMSN1, a gene previously associated with atherosclerosis and B cell activation. In conclusion, we have uncovered several novel genes associated with coronary atherosclerosis in individuals chronically exposed to increased ambient concentrations of traffic air pollution. These genes point towards inflammatory pathways that may modify the effects of air pollution on cardiovascular disease risk.Item A Hierarchical Bayesian Model for the Identification of PET Markers Associated to the Prediction of Surgical Outcome after Anterior Temporal Lobe Resection(Frontiers Media S.A., 2017) Chiang, Sharon; Guindani, Michele; Yeh, Hsiang J.; Dewar, Sandra; Haneef, Zulfi; Stern, John M.; Vannucci, MarinaWe develop an integrative Bayesian predictive modeling framework that identifies individual pathological brain states based on the selection of fluoro-deoxyglucose positron emission tomography (PET) imaging biomarkers and evaluates the association of those states with a clinical outcome. We consider data from a study on temporal lobe epilepsy (TLE) patients who subsequently underwent anterior temporal lobe resection. Our modeling framework looks at the observed profiles of regional glucose metabolism in PET as the phenotypic manifestation of a latent individual pathologic state, which is assumed to vary across the population. The modeling strategy we adopt allows the identification of patient subgroups characterized by latent pathologies differentially associated to the clinical outcome of interest. It also identifies imaging biomarkers characterizing the pathological states of the subjects. In the data application, we identify a subgroup of TLE patients at high risk for post-surgical seizure recurrence after anterior temporal lobe resection, together with a set of discriminatory brain regions that can be used to distinguish the latent subgroups. We show that the proposed method achieves high cross-validated accuracy in predicting post-surgical seizure recurrence.Item A joint modeling approach for longitudinal microbiome data improves ability to detect microbiome associations with disease(Public Library of Science, 2020) Luna, Pamela N.; Mansbach, Jonathan M.; Shaw, Chad A.Changes in the composition of the microbiome over time are associated with myriad human illnesses. Unfortunately, the lack of analytic techniques has hindered researchers’ ability to quantify the association between longitudinal microbial composition and time-to-event outcomes. Prior methodological work developed the joint model for longitudinal and time-to-event data to incorporate time-dependent biomarker covariates into the hazard regression approach to disease outcomes. The original implementation of this joint modeling approach employed a linear mixed effects model to represent the time-dependent covariates. However, when the distribution of the time-dependent covariate is non-Gaussian, as is the case with microbial abundances, researchers require different statistical methodology. We present a joint modeling framework that uses a negative binomial mixed effects model to determine longitudinal taxon abundances. We incorporate these modeled microbial abundances into a hazard function with a parameterization that not only accounts for the proportional nature of microbiome data, but also generates biologically interpretable results. Herein we demonstrate the performance improvements of our approach over existing alternatives via simulation as well as a previously published longitudinal dataset studying the microbiome during pregnancy. The results demonstrate that our joint modeling framework for longitudinal microbiome count data provides a powerful methodology to uncover associations between changes in microbial abundances over time and the onset of disease. This method offers the potential to equip researchers with a deeper understanding of the associations between longitudinal microbial composition changes and disease outcomes. This new approach could potentially lead to new diagnostic biomarkers or inform clinical interventions to help prevent or treat disease.Item A longitudinal cohort study of malaria exposure and changing serostatus in a malaria endemic area of rural Tanzania(BioMed Central, 8/2/2017) Simmons, Ryan A.; Mboera, Leonard; Miranda, Marie L.; Morris, Alison; Stresman, Gillian; Turner, Elizabeth L.; Kramer, Randall; Drakeley, Chris; O’Meara, Wendy P.Abstract Background Measurements of anti-malarial antibodies are increasingly used as a proxy of transmission intensity. Most serological surveys are based on the use of cross-sectional data that, when age-stratified, approximates historical patterns of transmission within a population. Comparatively few studies leverage longitudinal data to explicitly relate individual infection events with subsequent antibody responses. Methods The occurrence of seroconversion and seroreversion events for two Plasmodium falciparum asexual stage antigens (MSP-1 and AMA-1) was examined using three annual measurements of 691 individuals from a cohort of individuals in a malaria-endemic area of rural east-central Tanzania. Mixed-effect logistic regression models were employed to determine factors associated with changes in serostatus over time. Results While the expected population-level relationship between seroprevalence and disease incidence was observed, on an individual level the relationship between individual infections and the antibody response was complex. MSP-1 antibody responses were more dynamic in response to the occurrence and resolution of infection events than AMA-1, while the latter was more correlated with consecutive infections. The MSP-1 antibody response to an observed infection seemed to decay faster over time than the corresponding AMA-1 response. Surprisingly, there was no evidence of an age effect on the occurrence of a conversion or reversion event. Conclusions While the population-level results concur with previously published sero-epidemiological surveys, the individual-level results highlight the more complex relationship between detected infections and antibody dynamics than can be analysed using cross-sectional data. The longitudinal analysis of serological data may provide a powerful tool for teasing apart the complex relationship between infection events and the corresponding immune response, thereby improving the ability to rapidly assess the success or failure of malaria control programmes.Item A mathematical model as a tool to identify microRNAs with highest impact on transcriptome changes(BioMed Central, 2019) Mura, Marzena; Jaksik, Roman; Lalik, Anna; Biernacki, Krzysztof; Kimmel, Marek; Rzeszowska-Wolny, Joanna; Fujarewicz, KrzysztofBackground: Rapid changes in the expression of many messenger RNA (mRNA) species follow exposure of cells to ionizing radiation. One of the hypothetical mechanisms of this response may include microRNA (miRNA) regulation, since the amounts of miRNAs in cells also vary upon irradiation. To address this possibility, we designed experiments using cancer-derived cell lines transfected with luciferase reporter gene containing sequences targeted by different miRNA species in its 3′- untranslated region. We focus on the early time-course response (1 h past irradiation) to eliminate secondary mRNA expression waves. Results: Experiments revealed that the irradiation-induced changes in the mRNA expression depend on the miRNAs which interact with mRNA. To identify the strongest interactions, we propose a mathematical model which predicts the mRNA fold expression changes, caused by perturbation of microRNA-mRNA interactions. Model was applied to experimental data including various cell lines, irradiation doses and observation times, both ours and literature-based. Comparison of modelled and experimental mRNA expression levels given miRNA level changes allows estimating how many and which miRNAs play a significant role in transcriptome response to stress conditions in different cell types. As an example, in the human melanoma cell line the comparison suggests that, globally, a major part of the irradiation-induced changes of mRNA expression can be explained by perturbed miRNA-mRNA interactions. A subset of about 30 out of a few hundred miRNAs expressed in these cells appears to account for the changes. These miRNAs play crucial roles in regulatory mechanisms observed after irradiation. In addition, these miRNAs have a higher average content of GC and a higher number of targeted transcripts, and many have been reported to play a role in the development of cancer. Conclusions: Our proposed mathematical modeling approach may be used to identify miRNAs which participate in responses of cells to ionizing radiation, and other stress factors such as extremes of temperature, exposure to toxins, and drugs.Item A Network Biology Approach Identifies Molecular Cross-Talk between Normal Prostate Epithelial and Prostate Carcinoma Cells(Public Library of Science, 2016) Trevino, Victor; Cassese, Alberto; Nagy, Zsuzsanna; Zhuang, Xiaodong; Herbert, John; Antzack, Philipp; Clarke, Kim; Davies, Nicholas; Rahman, Ayesha; Campbell, Moray J.; Guindani, Michele; Bicknell, Roy; Vannucci, Marina; Falciani, FrancescoThe advent of functional genomics has enabled the genome-wide characterization of the molecular state of cells and tissues, virtually at every level of biological organization. The difficulty in organizing and mining this unprecedented amount of information has stimulated the development of computational methods designed to infer the underlying structure of regulatory networks from observational data. These important developments had a profound impact in biological sciences since they triggered the development of a novel data-driven investigative approach. In cancer research, this strategy has been particularly successful. It has contributed to the identification of novel biomarkers, to a better characterization of disease heterogeneity and to a more in depth understanding of cancer pathophysiology. However, so far these approaches have not explicitly addressed the challenge of identifying networks representing the interaction of different cell types in a complex tissue. Since these interactions represent an essential part of the biology of both diseased and healthy tissues, it is of paramount importance that this challenge is addressed. Here we report the definition of a network reverse engineering strategy designed to infer directional signals linking adjacent cell types within a complex tissue. The application of this inference strategy to prostate cancer genome-wide expression profiling data validated the approach and revealed that normal epithelial cells exert an anti-tumour activity on prostate carcinoma cells. Moreover, by using a Bayesian hierarchical model integrating genetics and gene expression data and combining this with survival analysis, we show that the expression of putative cell communication genes related to focal adhesion and secretion is affected by epistatic gene copy number variation and it is predictive of patient survival. Ultimately, this study represents a generalizable approach to the challenge of deciphering cell communication networks in a wide spectrum of biological systems.Item A Post Keynesian Analysis of the Black-Scholes Option Pricing Model(Rice University, 1999) Thompson, J.R.; Williams, E.E.Item A simple tree planting framework to improve climate, air pollution, health, and urban heat in vulnerable locations using non-traditional partners(Wiley, 2022) Hopkins, Loren P.; January-Bevers, Deborah J.; Caton, Erin K.; Campos, Laura A.Societal Impact Statement Planting trees is considered an effective method for climate change adaptation and mitigation. This framework provides a replicable blueprint to improve health, urban heat, flooding, and air pollution via a multisectoral, collaborative, environmental data-driven approach. Native tree species with targeted ecosystem services are selected, and sites are strategically identified based on environmental and health benefits, with the intent of engaging community involvement through education and large-scale tree plantings. Including non-traditional partners in the framework provides heightened awareness of the relationship between climate change and health, thus catalyzing decision-making regarding sustainable actions that reduce effects of climate change. This native tree planting framework is highly adaptable in other cities. Summary A multidisciplinary framework is presented for a data-driven, climate change adaptation and climate change and air pollution mitigation project. This framework leverages heightened awareness of the connections between climate change, air pollution, and health to expand the cadre and societal impacts of those working to intervene in resilience planning and implementation. The framework, implemented in Houston, Texas, USA, beginning in 2019, consists of three parts: (1) identification of optimal native tree species for climate change adaptations and air pollution mitigation around variables important locally; (2) selection of large-scale native tree planting locations where populations are already disproportionately experiencing flooding, increased heat, and air pollution-related health effects that could be further exacerbated from climate change; and (3) engagement of multisectoral leadership broadened beyond those traditionally working on climate change resilience through heightening awareness of the link to human health. Native tree species were identified that had the highest combination of absorption of carbon dioxide, other air pollutants, and water absorption (aiding in flood adaptation and air pollution/heat mitigation). Thousands of the top tree species were planted in locations that experience substantial flooding during large rain events, have high rates of health effects exacerbated by air pollution (e.g., cardiac arrest and asthma attacks), and experience multiple days of elevated heat and air pollution. This multidisciplinary framework addresses a critical need to provide interventions accessible to the community; educate on the connection between climate change adaptation, air pollution mitigation, and health; and foster multisectoral leadership to accelerate local resilience actions.Item A spatiotemporal case-crossover model of asthma exacerbation in the City of Houston(Wiley, 2021) Schedler, Julia C.; Ensor, Katherine B.Case-crossover design is a popular construction for analyzing the impact of a transient effect, such as ambient pollution levels, on an acute outcome, such as an asthma exacerbation. Case-crossover design avoids the need to model individual, time-varying risk factors for cases by using cases as their own ‘controls’, chosen to be time periods for which individual risk factors can be assumed constant and need not be modelled. Many studies have examined the complex effects of the control period structure on model performance, but these discussions were simplified when case-crossover design was shown to be equivalent to various specifications of Poisson regression when exposure is considered constant across study participants. While reasonable for some applications, there are cases where such an assumption does not apply due to spatial variability in exposure, which may affect parameter estimation. This work presents a spatiotemporal model, which has temporal case-crossover and a geometrically aware spatial random effect based on the Hausdorff distance. The model construction incorporates a residual spatial structure in cases when the constant assumption exposure is not reasonable and when spatial regions are irregular.Item A statistical model for removing inter-device differences in spectroscopy(Optical Society of America, 2014) Wang, Lu; Lee, Jong Soo; Lane, Pierre; Atkinson, E. Neely; Zuluaga, Andres; Follen, Michele; MacAulay, Calum; Cox, Dennis D.We are investigating spectroscopic devices designed to make in vivo cervical tissue measurements to detect pre-cancerous and cancerous lesions. All devices have the same design and ideally should record identical measurements. However, we observed consistent differences among them. An experiment was designed to study the sources of variation in the measurements recorded. Here we present a log additive statistical model that incorporates the sources of variability we identified. Based on this model, we estimated correction factors from the experimental data needed to eliminate the inter-device variability and other sources of variation. These correction factors are intended to improve the accuracy and repeatability of such devices when making future measurements on patient tissue.Item A systems biology approach reveals common metastatic pathways in osteosarcoma(BioMed Central, 2012) Flores, Ricardo J.; Li, Yiting; Yu, Alexander; Shen, Jianhe; Rao, Pulivarthi H.; Lau, Serrine S.; Vannucci, Marina; Lau, Ching C.; Man, Tsz-KwongBackground: Osteosarcoma (OS) is the most common malignant bone tumor in children and adolescents. The survival rate of patients with metastatic disease remains very dismal. Nevertheless, metastasis is a complex process and a single-level analysis is not likely to identify its key biological determinants. In this study, we used a systems biology approach to identify common metastatic pathways that are jointly supported by both mRNA and protein expression data in two distinct human metastatic OS models. Results: mRNA expression microarray and N-linked glycoproteomic analyses were performed on two commonly used isogenic pairs of human metastatic OS cell lines, namely HOS/143B and SaOS-2/LM7. Pathway analysis of the differentially regulated genes and glycoproteins separately revealed pathways associated to metastasis including cell cycle regulation, immune response, and epithelial-to-mesenchymal-transition. However, no common significant pathway was found at both genomic and proteomic levels between the two metastatic models, suggesting a very different biological nature of the cell lines. To address this issue, we used a topological significance analysis based on a “shortest-path” algorithm to identify topological nodes, which uncovered additional biological information with respect to the genomic and glycoproteomic profiles but remained hidden from the direct analyses. Pathway analysis of the significant topological nodes revealed a striking concordance between the models and identified significant common pathways, including “Cytoskeleton remodeling/TGF/WNT”, “Cytoskeleton remodeling/Cytoskeleton remodeling”, and “Cell adhesion/Chemokines and adhesion”. Of these, the “Cytoskeleton remodeling/TGF/WNT” was the top ranked common pathway from the topological analysis of the genomic and proteomic profiles in the two metastatic models. The up-regulation of proteins in the “Cytoskeleton remodeling/TGF/WNT” pathway in the SaOS-2/LM7 and HOS/143B models was further validated using an orthogonal Reverse Phase Protein Array platform. Conclusions: In this study, we used a systems biology approach by integrating genomic and proteomic data to identify key and common metastatic mechanisms in OS. The use of the topological analysis revealed hidden biological pathways that are known to play critical roles in metastasis. Wnt signaling has been previously implicated in OS and other tumors, and inhibitors of Wnt signaling pathways are available for clinical testing. Further characterization of this common pathway and other topological pathways identified from this study may lead to a novel therapeutic strategy for the treatment of metastatic OS.Item Accuracy of optical spectroscopy for the detection of cervical intraepithelial neoplasia without colposcopic tissue information; a step toward automation for low resource settings(Society of Photo-Optical Instrumentation Engineers, 2012-04) Yamal, Jose-Miguel; Zewdie, Getie A.; Cox, Dennis D.; Atkinson, E. Neely; Cantor, Scott B.; MacAulay, Calum; Davies, Kalatu; Adewole, Isaac; Buys, Timon P. H.; Follen, MicheleOptical spectroscopy has been proposed as an accurate and low-cost alternative for detection of cervical intraepithelial neoplasia. We previously published an algorithm using optical spectroscopy as an adjunct to colposcopy and found good accuracy (sensitivity ¼ 1.00 [95% confidence interval ðCIÞ ¼ 0.92 to 1.00], specificity ¼ 0.71 [95% CI ¼ 0.62 to 0.79]). Those results used measurements taken by expert colposcopists as well as the colposcopy diagnosis. In this study, we trained and tested an algorithm for the detection of cervical intraepithelial neoplasia (i.e., identifying those patients who had histology reading CIN 2 or worse) that did not include the colposcopic diagnosis. Furthermore, we explored the interaction between spectroscopy and colposcopy, examining the importance of probe placement expertise. The colposcopic diagnosis-independent spectroscopy algorithm had a sensitivity of 0.98 (95% CI ¼ 0.89 to 1.00) and a specificity of 0.62 (95% CI ¼ 0.52 to 0.71). The difference in the partial area under the ROC curves between spectroscopy with and without the colposcopic diagnosis was statistically significant at the patient level (p ¼ 0.05) but not the site level (p ¼ 0.13). The results suggest that the device has high accuracy over a wide range of provider accuracy and hence could plausibly be implemented by providers with limited training.Item Adverse Health Outcomes Following Hurricane Harvey: A Comparison of Remotely-Sensed and Self-Reported Flood Exposure Estimates(Wiley, 2023) Ramesh, Balaji; Callender, Rashida; Zaitchik, Benjamin F.; Jagger, Meredith; Swarup, Samarth; Gohlke, Julia M.Remotely sensed inundation may help to rapidly identify areas in need of aid during and following floods. Here we evaluate the utility of daily remotely sensed flood inundation measures and estimate their congruence with self-reported home flooding and health outcomes collected via the Texas Flood Registry (TFR) following Hurricane Harvey. Daily flood inundation for 14 days following the landfall of Hurricane Harvey was acquired from FloodScan. Flood exposure, including number of days flooded and flood depth was assigned to geocoded home addresses of TFR respondents (N = 18,920 from 47 counties). Discordance between remotely-sensed flooding and self-reported home flooding was measured. Modified Poisson regression models were implemented to estimate risk ratios (RRs) for adverse health outcomes following flood exposure, controlling for potential individual level confounders. Respondents whose home was in a flooded area based on remotely-sensed data were more likely to report injury (RR = 1.5, 95% CI: 1.27–1.77), concentration problems (1.36, 95% CI: 1.25–1.49), skin rash (1.31, 95% CI: 1.15–1.48), illness (1.29, 95% CI: 1.17–1.43), headaches (1.09, 95% CI: 1.03–1.16), and runny nose (1.07, 95% CI: 1.03–1.11) compared to respondents whose home was not flooded. Effect sizes were larger when exposure was estimated using respondent-reported home flooding. Near-real time remote sensing-based flood products may help to prioritize areas in need of assistance when on the ground measures are not accessible.