Data Science MS
Kirsten Keihl, Assistant Director
Valdes, G., Chang, A.J., Interian, Y., Owen, K., Jensen, S.T., Ungar, L.H., Cunnan, A., Solberg, T.D., Hsu, I. (2018). HDR salvage brachytherapy: Multiple hypothesis testing vs. machine learning analysis. International Journal of Radiation Oncology Biology Physics: https://doi.org/10.1016/j.ijrobp.2018.03.001.
Valdes, G., Interian, Y. (2018). Comment on 'Deep convolutional neural network with transfer learning for rectum toxicity prediction in cervical cancer radiotherapy: a feasibility study'. Physics in Medicine & Biology, 63(6), http://iopscience.iop.org/article/10.1088/1361-6560/aaae23/meta.
Interian, Y., Rideout, V., Keanery, V.P., Efstathios, G., Morin, O., Cheung, J., Solberg, T., and Valdes, G. (2018). Deep nets vs expert-designed features in medical physics: An IMRT QA case study. Forthcoming from Medical Physics.
Ma, J., Ovalle, A., Woodbridge, D.M. (2018). Medication adherence monitoring using machine learning. IEEE International Conference on Biomedical and Health Informatics (BHI), Las Vegas.
Chen, L., Li, R., Liu, Y., Zhang, R., & Woodbridge, D.M. (2017). Machine learning-based product recommendation using Apache Spark. IEEE UIC International Workshop on Data Science and Computational Intelligence (DSCI), San Francisco.
Da Rocha, L.T. and Stevens, N.T. (2017). Comparing two measurement systems using the probability of agreement web app. Quality Engineering. DOI:10.1080/08982112.2017.1361538
Stevens, N.T. and Anderson-Cook, C.M. (2017). Quantifying similarity in reliability surfaces using the probability of agreement. Quality Engineering, 29(3), 395–408.
Stevens, N.T. and Anderson-Cook, C.M. (2017). Comparing the reliability of related populations with the probability of agreement. Technometrics, 59(3), 371–380.
Stevens, N.T., Steiner, S.H., and MacKay, R.J. (2017). Comparing heteroscedastic measurement systems with the probability of agreement. Statistical Methods in Medical Research. DOI: 10.1177/0962280217702540.
Goodkind, A., Guy Brizan, D., and Rosenberg, A. (2017). Utilizing overt and latent linguistic structure to improve keystroke-based authentication. Image and Vision Computing, 58, 230–238.
Wilson, J.D., Desmarais, B., Cranmer, S., Denny, M., and Bhamidi, S. (2017). Stochastic weighted graphs: Flexible model specification and simulation. Social Networks, 49, 37–47.
Wilson, J.D., Palowitch, J., Bhamidi, S., and Nobel, A.B. (2017). Community extraction in multilayer networks with heterogeneous community structure. Journal of Machine Learning Research, 18.
Woodall, W.H., Zhao, M., Paynabar, K., Sparks, R., and Wilson, J.D. (2017). An overview and perspective on social network monitoring. IISE Transactions, 49:3, 354–365.
Stillman, P.E., Wilson, J.D., Denny, M.J., Desmarais, B., Bhamidi, S., Cranmer, S., and Lu, Z.L. (2017). Statistical modeling of the default mode brain network reveals a segregated highway structure. Scientific Reports, 7(1), 11694.
Parr, T. and Vinju, J. (2016). Towards a universal code formatter through machine learning. In Proceedings of The 9th ACM SIGPLAN International Conference on Software Language Engineering. (Awarded The Distinguished Paper)
Wilson, J.D., Desmarais, B., Cranmer, S., Denny, M. and Bhamidi, S. (2016). Stochastic weighted graphs: flexible model specification and simulation. Social Networks, 49.
Stevens, N.T. and Anderson-Cook, C.M. (2016). Comparing the reliability of related populations with the probability of agreement. Technometrics. DOI:10.1080/00401706.2016.1214180.
Stevens, N.T. and Jones-Farmer, L.A. (2016). Discussion of "Analyzing behavioral big data: Methodological, practical, ethical, and moral issues.” Quality Engineering, 29(1), 84–86.
Jones-Farmer, L.A. and Stevens, N.T. (2016). Discussion of "Bridging the gap between theory and practice in basic statistical process monitoring.” Quality Engineering, 29(1), 22–26.
Guy Brizan, D., Gallagher, K., Jahangir, A., and Brown, T. (2016). Predicting citation patterns: Defining and determining influence. Scientometrics 108 (1), 183-200.
Parker, K.S., Wilson, J.D., Marschall, J., Mucha, P.J., and Henderson, J.P. (2015). Network analysis reveals sex and antibiotic resistance associated antivirulence targets in clinical uropathogens. American Chemical Society: Infectious Diseases, 1(11), 523–532.
Szekely, E., Pappa, I., Wilson, J.D., Bhamidi, S., Jaddoe, V., Verhulst, H.T., and Shaw, P. (2015). Childhood peer network characteristics: Genetic influences and links with early mental health trajectories. Journal of Child Psychology and Psychiatry. DOI: 10.1111/jcpp.12493
Bertozzi, A.L., Kolokolnikov, T., Sun, H., Uminsky, D., and von Brecht, J. (2015). Ring patterns and their bifurcations in a nonlocal model of biological swarms. Communications in Mathematical Sciences, 13 (4).
Dixon, M.F. (2015). A pattern oriented approach for designing scalable analytics applications. Invited Paper, 2nd ACM Workshop on Parallel Computing for Analytics Applications, PPoPP'15.
Dixon, M.F., Lotze, J., and Zubair, M. (2015). A portable, fast, and flexible stochastic volatility model calibration using multi and many-core processors. Journal of Concurrency and Computation: Practice and Experience.
Stevens, N.T., Steiner, S.H., and MacKay, R.J. (2015). Assessing agreement between two measurement systems: An alternative to the limits of agreement approach. Statistical Methods in Medical Research.
Stevens, N.T., Steiner, S.H., and MacKay, R.J. (2015). Being smart about parts. Quality Progress, March, 32–37.
Brost, R., Phillips, C., Robinson, D., Stracuzzi, D., Wilson, A. and Woodbridge, D.M. (2015). Computing quality scores and uncertainty for approximate pattern matching in geospatial semantic graphs. Statistical Analysis and Data Mining (SADM), 8(5-6), 340–352.
Woodbridge, D.M., Wilson, A.T., Rintoul, M.D., and Goldstein, R.H. (2015). Time series discord detection in medical data using a parallel relational database. IEEE International Conference on Bioinformatics and Biomedicine, Washington, DC.
Goodkind, A., Guy Brizan, D., and Rosenberg, A. (2015). Improvements to keystroke-based authentication by adding linguistic context. International Conference on Biometrics: Theory, Applications and Systems, Arlington, Virginia.
An, G., Guy Brizan, D., Ma, M., Morales, M., Raza Syed, A., and Rosenberg, A. (2015). Automatic recognition of unified Parkinson’s disease rating from speech with acoustic, i-Vector and phonotactic features. Interspeech Conference, Dresden, Germany.
Guy Brizan, D., Goodkind, A., Koch, P., Balagani, K., Phoha, V.V., and Rosenberg, A. (2015). Utilizing linguistically-enhanced keystroke dynamics to predict typist cognition and demographics. International Journal of Human-Computer Studies.
Locklear, H., Govindarajan, S., Sitova, Z., Goodkind, A., Guy Brizan, D., Rosenberg, A., Phoha, V., Gasti, P., and Balagani, K.S. (2014). Continuous authentication with cognition-centric text production and revision features. International Joint Conference on Biometrics (IJCB), Clearwater, Florida.
Katerenchuk, D., Guy Brizan, D., and Rosenberg, A. (2014). “Was that your mother on the phone?”: Classifying interpersonal relationships between dialog participants with lexical and acoustic properties. Interspeech Conference, Singapore.
Brost, R.C., McLendon, W.C., III, Parekh, O., Rintoul, M.D., Strip, D.R., and Woodbridge, D.M. (2014). A computational framework for ontologically storing and analyzing very large overhead image sets. Presented at ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data, Dallas.
Lindsay, S. and Woodbridge, D.M. (2014). Spacecraft state-of-health (SOH) analysis via Data Mining. AIAA International Conference on Space Operations (SpaceOps), Pasadena, CA.
Anderson, P., McGuee, J., and Uminsky, D. (2014). Data science as an undergraduate degree. The 45th ACM Technical Symposium on Computer Science Education (SIGCSE 2014).
Dixon, M.F. (2014). Risk decomposition for fund managers. Proceedings of R/Finance.
Dixon, M.F., Lotze, J., and Zubair, M. (2014). A portable and fast stochastic volatility model calibration using multi and many-core processors. ACM Proceedings of the 7th Workshop on High Performance Computational Finance, SC'14, New Orleans, LA.
Dixon, M.F., and Chong, J. (2014). A Bayesian approach to ranking private companies based on predictive indicators. Journal of AI Communications: Special Track on Soft Computing in Finance and Economics, 28(2).
Dixon, M.F., Khan, S., and Zubair, M. (2014). gpusvcalibration: A R package for fast stochastic volatility model calibration using GPUs. Proceedings of R/Finance.
Dixon, M.F., Khan, S., and Zubair, M. (2014). Accelerating option risk analytics in R using GPUs. Proceedings of the 22nd High Performance Computing Symposium (HPC 14), April 13-16.
Parr, T., Harwell, S., and Fisher, K. (2014). Adaptive LL(*) parsing: The power of dynamic analysis. OOPSLA. Portland, OR.
Ramgopal, S., Thome-Souza, S., Jackson, M., Kadish, N.E., Sanchez Fernandez, I., Klehm, J., Bosl, W., Reinsberger, C., Schachter, S., and Loddenkemper, T. (2014). Seizure detection, seizure prediction, and closed-loop warning systems in epilepsy. Epilepsy Behavior, 37C, 291–307.
Wilson, J.D., Wang, S., Mucha, P.J., Bhamidi, S., and Nobel, A.B. (2014). A testing based extraction algorithm for identifying significant communities in networks. Annals of Applied Statistics, 8(3), 1853–1891.
Broxton, T., Interian, Y., Vaver, J.V., & Wattenhofer, M. (2013). Catching a viral video. Journal of Intelligent Information Systems, 40(2), 241–259.
Dixon, M.F., & Zubair, M. (2013). Calibration of stochastic volatility models on a multi-core CPU cluster. ACM Proceedings of the Workshop on High Performance Computing, SC’13, Denver, CO.
Dixon, M.F., Aiello, S.P., Fapohunda, F., and Goldstein, W. (2013). Detecting mobility patterns in mobile phone data from the Ivory Coast. Netmob.
Engle, S., and Gates, C. (2013). Reflecting on visualization for cyber security. Proceedings of the 2013 IEEE International Conference on Intelligence and Security Informatics (ISI), from the Evaluating Security Visualizations Workshop, Seattle, Washington, 275–277.
Intrevado, P. Abel, P. and Oszen, L. (2013). Inpatient pharmacy operations: An inter-professional literature review. American Journal of Health-System Pharmacy, 70(11).
Parr, T. (2013). The definitive ANTLR 4 reference. Dallas, TX: Pragmatic Bookshelf.
Rousseaux, G., Levy, R., and Uminsky, D. (2013). Flukeprints of cetaceans and the corresponding shear-flow phenomenon. The International Symposium on Turbulence and Shear Flow Phenomena (TSFP-8).
Stevens, N.T., Steiner, S.H., Browne, R., and MacKay, R.J. (2013). Gauge R&R studies that incorporate baseline information. IIE Transactions, 45(11), 1166–1175.
Wilson, J.D., Bhamidi, S., and Nobel, A.B. (2013). Measuring the statistical significance of local connections in directed networks. Neural Information Processing Systems Workshop on Frontiers of Network Analysis: Methods, Models and Applications.
von Brecht, J., Laurent, T., Bresson, X., and Uminsky, D. (2013). Multiclass total variation clustering. Advances in Neural Information Processing Systems 26 (NIPS 2013), 1421–1429.
Serwadda, A., Wang, Z., Koch, P., Govindarajan, S., Pokala, R., Goodkind, A., Guy Brizan, D., Rosenberg, A., Phoha, V.V., and Balagani, K.S. (2013). Scan-based evaluation of continuous keystroke authentication systems. IT Professional, 15(4): 20–23.
An, G., Guy Brizan, D., and Rosenberg, A. (2013). Detecting laughter and filled pauses using syllable-based features. Interspeech, Lyon, France.
Suh, M., Lan, M., Samy, L., Alshurafa, N., Ghasemzadeh, H., Sarrafzadeh, M. and Macabasco-O'Connell, A. (2012). WANDA: An end-to-end remote health monitoring and analytics system for heart failure patients. Presented at Wireless Health Conference, San Diego.
Suh, M., Nahapetian, A., Woodbridge, J., Rofouei, M. and Sarrafzadeh, M. (2012). Machine learning-based adaptive wireless interval training guidance system. Mobile Networks and Applications, 17(2), 163-177.
Suh, M., Woodbridge, J., Moin, T., Lan, M., Alshurafa, N., Samy, L., Mortazavi, B., Ghasemzadeh, H., Bui, A., Ahmadi, S. and Sarrafzadeh, M. (2012). Dynamic task optimization in remote diabetes monitoring systems. IEEE International Conference on Healthcare Informatics, Imaging and Systems Biology (HISB), San Jose.
Suh, M., Moin, T., Woodbridge, J., Lan, M., Ghasemzadeh, H., Ahmadi, S., Bui, A., and Sarrafzadeh, M. (2012). Dynamic self-adaptive remote health monitoring system for diabetics. International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), San Diego.
Bishop, M., Engle, S., Howard, D., & Whalen, S. (2012). A taxonomy of buffer overflow characteristics. IEEE Transactions on Dependable and Secure Computing (TDSC), Volume 9, Issue 3, 305–317.
Bishop, M., Engle, S., Peisert, S., D., & Whalen, S. (2012). Network-theoretic classification of parallel computation patterns. International Journal of High Performance Computing Applications (IJHPCA), Volume 26, Number 2, 159–169.
Bosl, W. (2012). Neurotechnology and psychiatric biomarkers. In D. Ghista, (ed), Biomedical Engineering – Book 3. InTech Publishers.
Bresson, X., Laurent, T., Uminsky, D., & von Brecht, J.H. (2012). Convergence and energy landscape for Cheeger cut clustering. Advances in Neural Information Processing Systems 25 (NIPS 2012), 1394-1402.
Devlin, S., & Treloar, T. (2012). Network-based criterion for the success of cooperation in an evolutionary prisoner's dilemma. Devlin, S., & Treloar, T. (2012). Phys. Rev. E 86, 26-113.
Engle, S. & Whalen, S. (2012). Visualizing distributed memory computations with hive plots. Proceedings of the Ninth International Symposium on Visualization for Cyber Security (VizSec), Seattle, Washington, 56–63, October 2012.
Hamrick, J., Russ, J., Bu, K., & Cizdziel, J. (2012). Laser ablation - Inductively coupled plasma-mass spectometry analysis of lower Pecos rock paints and possible pigment sources. Collaborative Endeavors in the Chemical Analysis of Art and Cultural Heritage Materials. Washington, D.C.: American Chemical Society.
Sun, H., Uminsky, D., & Bertozzi, A.L. (2012). Stability and clustering of self-similar solutions of aggregation equations. Journal of Mathematical Physics, Volume 53, 115-610.
Uminsky, D., Wayne, C.E., & Barbaro, A. (2012). A multi-moment vortex method for 2D viscous fluids. Journal of Computational Physics, Volume 231(1), 1705-1727.
von Brecht, J. & Uminsky, D. (2012). On soccer balls and linearized inverse statistical mechanics. Journal of Nonlinear Science, Volume 22, Issue 6, 935-959.
Bosl, W., Tager-Flusberg, H., Tierney, A., & Nelson, C.A. (2011). EEG complexity as a biomarker for autism spectrum disorder risk. BMC Medicine, 9:18.
Suh, M., Chen, C.A., Woodbridge, J., Tu, M.K., Kim, J.I., Nahapetian, A., Evangelista, L.S. and Sarrafzadeh, M. (2011). "A Remote Patient Monitoring System for Congestive Heart Failure." Journal of Medical Systems, 35(5), 1165-1179.
Suh, M., Woodbridge, J., Lan, M., Bui, A., Evangelista, L.S. and Sarrafzadeh, M. (2011). Missing Data Imputation for Remote CHF Patient Monitoring. Presented at International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Boston.
Dixon, M.F., Bai, Z., Brush, C.F., Chung, F.I., Dogrul, E.C., & Kadir, T.N. (2011). Error control of iterative linear solvers for integrated groundwater models. Groundwater 49 N. 6, 859-865.
Dixon, M.F., Bradley, T., Chong, J. & Keutzer, K. (2011). Financial market value-at-risk estimation using the monte carlo method. In W. Hwu, M. Kaufmann (Eds.), GPU Computing Gems Jade Edition, (pp. 337-358). Waltham, MA: Morgan Koffman Publishers.
Dixon, M.F., Chong, J. & Keutzer, K. (2011). Enabling technology for more pervasive and responsive market risk management systems. In M. Wong (Ed.), Risk of Investment Products (pp. 205-224), Singapore: World Scientific Publishing Co.
Dixon, M.F., Chong, J., & Keutzer, K., (2011). Accelerating value-at-risk estimation on highly parallel architectures. WHPCF Special Issue of the Journal of Concurrency and Computation: Practice and Experience.
Dorai-Raj, S., Interian, Y., Naverniouk, I., & Zigmond, D. (2011). Adapting online advertising techniques to television. Online Multimedia Advertising: Techniques and Technologies (pp. 148-165). Hershey, PA: Information Science Reference.
Hamrick, J., Taqqu, M.S., & Pecatti, G. (2011). Practical implementation using Mathematica. Appendix A, Wiener chaos: Moments, cumulants, and diagrams. Milan, Italy: Bocconi-Springer.
Hamrick, J. (2011). Using local correlation to explain success in baseball. Journal of Quantitative Analysis in Sports, Volume 7: Issue 4.
Hamrick, J., Kardaras, K., Taqqu, M.S., & Huang, Y. (2011). Maximum penalized quasi-likelihood estimation of the diffusion function. Quantitative Finance, 11:11.
Hamrick, J., & Rasp, J. (2011). Using local correlation to explain success in baseball. Journal of Quantitative Analysis in Sports. Volume 7, Issue 4.
Hamrick, J., & Rasp, J. (2011). The connection between race and called strikes and balls. Journal of Sports Economics, 00(0), 1-21.
Parr, T., Fisher, K. (2011). The foundation of the ANTLR parser generator. Programming language design and implementation (PLDI), San Jose, CA.
Steiner, S.H., Stevens, N.T., Browne, R., & MacKay, R.J. (2011). Planning and analysis of measurement reliability studies. Canadian Journal of Statistics, 39(2), 344–355.
Stevens, N.T., Smith, I.R., Steiner, S.H. & MacKay, R.J. (2011). Monitoring radiation in cardiology imaging equipment. Medical Physics, 38(1), 317–326.
Devlin, S., & Treloar, T. (2010). Reply to comment on cooperation in an evolutionary prisoners dilemma on networks with degree-degree correlations. Phys Rev. E. 82, 038-102.
Dorai-Raj, S., Interian, Y., & Zigmond, D. (2010). Evaluating TV ad campaigns using set-top box data. Re:Think 2010.
Simidchieva, B.I., Engle, S., Clifford, M., Jones, A.C., Peisert, S., Bishop, M., Clarke, L.A., & Osterweil, L.J. (2010). Modeling and analyzing faults to improve election process robustness. In the Proceedings of the USENIX Electronic Voting Technology Workshop/Workshop on Trustworthy Elections (EVT/WOTE).
Stevens, N.T., Browne, R., Steiner, S.H. & MacKay, R.J. (2010). Augmented measurement system assessment. Journal of Quality Technology, 42(4), 388–399.
Suh, M., Evangelista, L.S., Chen, C.A., Han, K., Kang, J., Tu, M.K., Chen, V., Nahapetian, A. and Sarrafzadeh, M. (2010). An Automated Vital Sign Monitoring System for Congestive Heart Failure Patients. Presented at ACM SIGHIT International Health Informatics Symposium (IHI), Miami.
Suh, M., Evangelista, L.S., Chen, V., Hong, W.S., Macbeth, J., Nahapetian, A., Figueras, F.J. and Sarrafzadeh, M. (2010). WANDA B.: Weight and Activity with Blood Pressure Monitoring System for Heart Failure Patients. Presented at The Second International IEEE WoWMoM Workshop on Interdisciplinary Research on E-Health Services and Systems (IREHSS), Lucca, Italy.
Devlin, S., & Treloar, T. (2009). Cooperation in an evolutionary prisoners dilemma game on networks with degree-degree correlations. Phys. Rev. E 80, 26-105.
Devlin, S., & Treloar, T. (2009). Evolution of cooperation through the heterogeneity of random networks. Phys. Rev. E 79, 16-107.
Interian, Y., Dorai-Raj, S., Naverniouk, I., Opalinski, P. J., Kaustuv, & Zigmond, D. (2009). Ad quality on TV: Predicting television audience retention. Proceedings of International workshop on Data Mining and Audience Intelligence for Advertising (ADKDD).
Interian, Y., Dorai-Raj, S., Naverniouk, I., Opalinski, P. J., Kaustuv, & Zigmond, D. (2009). Do Viewers Care? Understanding the impact of ad creatives on TV viewing behavior. Re:Think 2009 .
Zigmond, D., Interian, Y., Lanning, S., Hawkins, J., Mirisola, R., Rowe, S., & Volovich, Y. (2009). When viewers control the schedule: Measuring the impact of digital video recording on TV viewership. Key Issues Forums at ARF Audience Measurement Conference, 2009.
Dixon, M.F., Chong, J., & Keutzer, K. (2009). Acceleration of market value-at-risk estimation. WHPCF '09: ACM Proceedings of the 2nd Workshop on High Performance Computational Finance (pp. 1-8). Held in conjunction with Supercomputing 09.
Parr, T. (2009). Language Implementation Patterns. Dallas, TX: Pragmatic Bookshelf.
Suh, M., Lee, K., Heu, A., Nahapetian, A. and Sarrafzadeh, M. (2009). Bayesian Networks-Based Interval Training Guidance System for Cancer Rehabilitation. Presented at Conference on Mobile Computing, Applications, and Services (MobiCASE), San Diego.
Suh, M., Dorman, K., Yahyanejad, M., Nahapetian, A., Sarrafzadeh, M., McCarthy, W. and Kaiser, W. (2009). Nutrition Monitor: A Food Purchase and Consumption Monitoring Mobile System. Presented at Conference on Mobile Computing, Applications, and Services (MobiCASE), San Diego.
Suh, M., Lee, K., Nahapetian, A. and Sarrafzadeh, M. (2009). Interval Training Guidance System with Music and Wireless Group Exercise Motivations. Presented at IEEE Symposium on Industrial Embedded Systems (SIES), Lausanne, Switzerland.
Suh, M., Rofouei, M., Nahapetian, A., Kaiser, W.J. and Sarrafzadeh, M. (2009). Optimizing Interval Training Protocols Using Data Mining Decision Trees. Presented at Body Sensor Networks (BSN).
Intrevado, P., Abel, S.R. & Hartgrove, K. (2008). Validation study: Clarity Multistrip Urocheck. Clinical Laboratory Science, Vol. 21, No. 3.
Intrevado, P., Abel, S.R., Kelm, M., & Jackson, H. (2008). Interdisciplinary analysis of chemotherapy preparation at a pediatric hospital. Journal for Healthcare Quality - Special Pediatric Issue, Vol. 30, No. 5.
Bosl, W. (2007). Systems biology by the rules: Hybrid intelligent systems for pathway discovery and analysis. BMC Systems Biology, 1:13.
Intrevado, P., Kopach, R., DeLaurentis, P-C., Lawley, M., Muthuraman, K., Ozsen, L., Rardin, R, Wan, H., Qu, X., & Willis, D. (2007). Effects of clinical characteristics on successful open access scheduling. Journal of Health Care Management Science, Vol.10, No. 2.
Lawson, B., Orrison, M., & Uminsky, D. (2006). Spectral analysis of the supreme court. Mathematics Magazine, 79(5), 340-346.
Guy Brizan, D., and Uz Tansel, A. (2006). A survey of entity resolution and record linkage methodologies. Communications of the IIMA. 6(3).
Guy Brizan, D. (2006). Evaluation of Equivalence of French/English Idiomatic Pairs. Technical Report. San Francisco State University.