2024-2025 Academic Catalog
Welcome to Virginia Tech! We are excited that you are here planning your time as a Hokie.
Welcome to Virginia Tech! We are excited that you are here planning your time as a Hokie.
Statistics courses are offered at both the undergraduate and the graduate levels for students preparing for professions in statistics, for students who need statistical tools to engage in scientific research, and for students who want to acquire knowledge of the important concepts of probability and statistical inference.
Statistics courses for graduate students and programs leading to the M.S. and Ph.D. degrees in statistics are described in the Graduate Catalog and in a special bulletin available from the department.
All statistics majors are required to own specified personal computers and software. Consult the department for details.
Internship positions are available in industry and government, offering valuable practical experience. Students participating in such an experience can receive academic credit which will count towards graduation requirements.
Please visit the University Registrar website at http://registrar.vt.edu/graduation-multi-brief/index1.html to view requirements for the minor.
The department reserves the right to withhold credit if a student takes a course, the content of which is partially duplicated in a course already taken (see "Course Duplications" below).
Associated with the Department, the Statistical Applications and Innovations Group (SAIG) provides assistance for research projects to participating members of the University community and outside organizations. Statistics Department faculty members and students collaborate to design studies, analyze data, and interpret results for Virginia Tech affiliated clients and external clients in business, industry, government, and non-profit organizations. SAIG provides both experiential learning for statistics students and service to the University and beyond. To learn more, visit https://saig.stat.vt.edu/.
University policy requires that students who are making satisfactory progress toward a degree meet minimum criteria toward the Pathways to General Education and toward the degree.
Satisfactory progress requirements toward the B.S. in Statistics can be found on the major checksheet by visiting the University Registrar website at https://www.registrar.vt.edu/graduation-multi-brief/checksheets.html.
Many statistics courses involve the use of statistics software, primarily MINITAB, SAS, JMP or R. Experience with the software is not expected, but students should have familiarity with either the Windows or Macintosh operating system and have access to a computer.
Many of the upper-division courses include a project, generally to be completed in small groups. These projects are designed to give students the kind of insight and experience in realistic statistical practice that cannot be obtained in classroom lectures or short-term homework assignments.
Head: D. Higdon
Professors: P. Du, M. Ferreira, R. Fricker, R. Gramacy, F. Guo, D. Higdon, Y. Hong, I. Hoeschelle, I. Kim, J. Morgan, E. Smith, G. Vining.
Associate Professors: X. Deng, C. Franck, P, L. House, L. Johnson, Leman, G. Terrell, X. Wu, H. Zhu
Professor of Practice: A. Hanlon, J. Van Mullekom, T. Woteki
Associate Professor of Practice: F. Faltin, A. Patterson
Assistant Professors: M. Liu, X. Xing, and J. Datta
Collegiate Associate Professors: A. Driscoll, J. Robertson Evia
Collegiate Assistant Professors: C. Lucero, H. Mahmoud, F. McCarty, Sierra Merkes
Research Professor: L. Freeman
Research Associate Professor: A. Tegge
Instructors: J. Loda, J. Russell, H. Tavera, Z. Zhang
Introduction to the field of statistics and aspects of college life for first year students. Topics included: history of the statistics; key roles of statisticians in field, such as actuarial sciences, pharmaceutical, medical, and bioinformatics industries, governmental agencies, academia; fundamental principles of statistical fields of study and applications; exploring data sets; and aspects of college life for first-year students.
Develop and practice the process of thinking critically with data in the context of real world problems. Import, manage, summarize, and visualize data using programmable, statistical software. Make data discoveries, make decisions, generate hypotheses, and/or communicate findings in data. Consider laws of probability and personal biases to weigh decisions. Recognize ethical issues and vulnerabilities in analyses when learning from data and extrapolating to large populations.
Fundamental concepts and methods of statistics with emphasis on interpretation of statistical arguments and statistical reasoning. Using modern, accessible statistical software and technology, an introduction to design of experiments (including data collection), data analysis, data visualization, correlation and regression, concepts of probability theory, sampling errors, confidence intervals, and hypothesis tests. Include real-world applications to develop problem-solving skills and consider ethical implications within the context of learning from data. No credit will be given for 2004 if taken with or after any other statistics course, except STAT 2984.
Introduction to R/RStudio programming techniques with an emphasis on basic statistical visualizations, descriptive and summary statistics, and elementary inferential statistics. Topics include data types, data structures, importing/exporting, and manipulating datasets, functions, packages, and RMarkdown.
Use of Python code and libraries (SciPy and NumPy) to support basic statistical tasks, create graphical displays, and perform statistical inference and hypothesis tests to evaluate datasets. Use of editors and AI to generate Python code.
Honors section.
3005: Basic statistical methodology: exploratory data techniques, estimation, inference, comparative analysis by parametric, nonparametric, and robust procedures. Analysis of variance (one-way), multiple comparisons, and categorical data. Includes real-world examples. Develops problem-solving skills and ethical reasoning within the context of learning from data. 3006: Analysis of variance, simple and multiple, linear and nonlinear regression, analysis of covariance. Use of MINITAB. STAT 3005 duplicates STAT 3615 and STAT 4604, only one may be taken for credit. STAT 3006 duplicates STAT 3616, STAT 4604 and STAT 4706, only one may be taken for credit.
3005: Basic statistical methodology: exploratory data techniques, estimation, inference, comparative analysis by parametric, nonparametric, and robust procedures. Analysis of variance (one-way), multiple comparisons, and categorical data. Includes real-world examples. Develops problem-solving skills and ethical reasoning within the context of learning from data. 3006: Analysis of variance, simple and multiple, linear and nonlinear regression, analysis of covariance. Use of MINITAB. STAT 3005 duplicates STAT 3615 and STAT 4604, only one may be taken for credit. STAT 3006 duplicates STAT 3616, STAT 4604 and STAT 4706, only one may be taken for credit.
Introduction to basic programming techniques: creating DATA and PROC statements, libraries, functions, programming syntax and formats. Other topics include loops, SAS Macros and PROC IML. Emphasis is placed on using these tools for statistical analyses. The pre-requisite may be substituted for an equivalent course.
Probability theory, including set theoretic and combinatorial concepts; in-depth treatment of discrete random variables and distributions, with some introduction to continuous random variables; introduction to estimation and hypothesis testing.
Using quantitative and qualitative thinking to develop a working knowledge of data visualization considerations, methods and techniques that lead to: understanding the audience(s); creating ethical data stories; data visualization as a method of storytelling; ethical and appropriate data exploration, manipulation, and cleaning; design considerations; types of visualizations; tools and resources for creating visualizations.
Introduction to sports analytics, sources of sports analytics data and data collection methods, visualization techniques, game performance statistics, inferential statistics and predictive modeling techniques for sports data. Role and applications of data analytics in the sports industry.
Statistical methodology based on ranks, empirical distributions, and runs. One and two sample tests, ANOVA, correlation, goodness of fit, and rank regression, R-estimates and confidence intervals. Comparisons with classical parametric methods. Emphasis on assumptions and interpretation.
Statistical methods for nominal, ordinal, and interval levels of measurement. Topics include descriptive statistics, elements of probability, discrete and continuous distributions, one and two sample tests, measures of association. Emphasis on comparison of methods and interpretations at different measurement levels. Includes real-world applications to develop problem-solving skills and ethical reasoning within the context of learning from data.
Descriptive and inferential statistics in a biological context with real-world examples. In analytical contexts, develops problem-solving skills and ethical reasoning. 3615: Fundamental principles, one- and two-sample parametric inference, simple linear regression, frequency data. 3616: One- and two-way ANOVA, multiple regression, correlation, nonparametrics, using a computer package. STAT 3615 partially duplicates STAT 3005 and STAT 4604, only one may be taken for credit. STAT 3616 partially duplicates STAT 3006, 4604 and 4706, only one may be taken for credit.
Descriptive and inferential statistics in a biological context with real-world examples. In analytical contexts, develops problem-solving skills and ethical reasoning. 3615: Fundamental principles, one- and two-sample parametric inference, simple linear regression, frequency data. 3616: One- and two-way ANOVA, multiple regression, correlation, nonparametrics, using a computer package. STAT 3615 partially duplicates STAT 3005 and STAT 4604, only one may be taken for credit. STAT 3616 partially duplicates STAT 3006, 4604 and 4706, only one may be taken for credit.
Basic principles and techniques in data analytics; methods for the collection of, storing, accessing, and manipulating standard-size and large datasets; data visualization; and identifying sources of bias.
Introduction to statistical methodology with emphasis on engineering experimentation: probability distributions, estimation, hypothesis testing, regression, and analysis of variance. Only one of the courses 3704, 4604, 4705, and 4714 may be taken for credit.
Computationally intensive computer methods used in statistical analyses. Statistical univariate and multivariate graphics; resampling methods including bootstrap estimation and hypothesis testing and simulations; classification and regression trees; scatterplot smoothing and splines.
Theory and examples of effective communication in the context of statistical collaborations. Practice developing the communication skills necessary to be effective statisticians using peer feedback and self-reflection. Topics include helping scientists answer their research questions, writing about and presenting statistical concepts to a non-statistical audience, and managing an effective statistical collaboration meeting. Senior standing in the Department of Statistics.
Introduction to R programming techniques with an emphasis on statistical analyses. Topics include: data objects, loops, importing/exporting datasets, graphics, functions, t-tests, ANOVA, linear regression, nonparametric tests, and logistic regression.
4105: Probability theory, counting techniques, conditional probability; random variables, moments; moment generating functions; multivariate distributions; transformations of random variables; order statistics. 4106: Convergence of sequences of random variables; central limit theorem; methods of estimation; hypothesis testing; linear models; analysis of variance. STAT 4105 partially duplicates STAT 4705, STAT 4714, and STAT 4724, only one may be taken for credit.
4105: Probability theory, counting techniques, conditional probability; random variables, moments; moment generating functions; multivariate distributions; transformations of random variables; order statistics. 4106: Convergence of sequences of random variables; central limit theorem; methods of estimation; hypothesis testing; linear models; analysis of variance. STAT 4105 partially duplicates STAT 4705, STAT 4714, and STAT 4724, only one may be taken for credit.
Fundamental principles of designing and analyzing experiments with application to problems in various subject matter areas. Discussion of completely randomized, randomized complete block, and Latin square designs, analysis of covariance, split--plot designs, factorial and fractional designs, incomplete block designs.
Multiple regression including variable selection procedures; detection and effects of multicollinearity; identification and effects of influential observations; residual analysis; use of transformations. Non-linear regression, the use of indicator variables, and logistic regression. Use of SAS.
Statistical analysis of sports data. Game performance statistics and expected scores. Analysis of player performance, player tracking, team performance, and sports betting. Bayesian methods and prediction models applied to sports data. Decision-making. Assessing sports analytics research and literature.
Statistical methods for bioinformatics and genetic studies, with an emphasis on statistical analysis, assumptions, and problem-solving. Topics include: commonly used statistical methods for gene identification, association mapping and other related problems. Focus on statistical tools for gene expression studies and association studies, multiple comparison procedures, likelihood inference and preparation for advanced study in the areas of bioinformatics and statistical genetics.
Introduction to Bayesian methodology with emphasis on applied statistical problems: data displaying, prior distribution elicitation, posterior analysis, models for proportions, means and regression.
Non-mathematical study of multivariate analysis. Multivariate analogs of univariate test and estimation procedures. Simultaneous inference procedures. Multivariate analysis of variance, repeated measures, inference for dispersion and association parameters, principal components analysis, discriminate analysis, cluster analysis. Use of SAS.
Statistical approaches to analyze categorical data. Probability computation and distribution specification, interval estimation and hypothesis testing, formulating and fitting generalized linear models including logistic and Poisson regression, algorithms used for model fitting, variable selection, and classification trees and supervised learning.
Statistical methods for the design and analysis of survey sampling. Fundamental survey designs. Methods of randomization specific to various survey designs. Estimation of population means, proportions, totals, variances, and mean squared errors. Design of questionnaires and organization of a survey.
Applied course in time series analysis methods. Topics include regression analysis, detecting and address autocorrelation, modeling seasonal or cyclical trends, creating stationary time series, smoothing techniques, forecasting and forecast errors, and fitting autoregressive integrated moving average models.
Introduction to those topics in advanced calculus and linear algebra needed by statistics majors. Infinite sequences and series. Orthogonal matrices, projections, quadratic forms. Extrema of functions of several variables. Multiple integrals, including convolution and nonlinear coordinate changes.
Introduction to statistical methodology with emphasis on engineering applications: probability distributions, estimation, hypothesis testing, regression, analysis of variance, quality control. Only one of the courses 4604, 4705, and 4714 may be taken for credit. STAT 4604 partially duplicates STAT 3005, STAT 3615, STAT 3006, STAT 3616 and STAT 4706. Only one may be taken for credit.
A technical analytics course. Covers supervised and unsupervised learning strategies, including regression, generalized linear models, regularization, dimension reduction methods, tree-based methods for classification, and clustering. Upper-level analytical methods shown in practice: e.g., advanced naive Bayes and neural networks.
Stochastic modeling methods with an emphasis in computing are taught. Select concepts from the classical and Bayesian paradigms are explored to provide multiple perspectives for how to learn from complex, datasets. There is particular focus on nested, spatial, and time series models.
Basic concepts of probability and statistics with emphasis on engineering applications. 4705: Probability, random variables, sampling distributions, estimation, hypothesis testing, simple linear regression correlation, one-way analysis of variance. 4706: Multiple regression, analysis of variance, factorial and fractional experiments. Only one of the courses 3704, 4604, 4705, 4714, and 4724 may be taken for credit.
Basic concepts of probability and statistics with emphasis on engineering applications. 4705: Probability, random variables, sampling distributions, estimation, hypothesis testing, simple linear regression correlation, one-way analysis of variance. 4706: Multiple regression, analysis of variance, factorial and fractional experiments. Only one of the courses 3704, 4604, 4705, and 4714 may be taken for credit.
Introduction to the concepts of probability, random variables, estimation, hypothesis testing, regression, and analysis of variance with emphasis on application in electrical engineering. Only one of the courses 3704, 4604, 4705, 4714 and 4724 may be taken for credit.
Introduction to deep learning, including algorithms, theoretical motivations, and implementation in practice. Basic neural network architectures and optimizations. Multilayer perceptrons, backpropagation, automatic differentiation, and stochastic gradient descent. Convolutional neural networks, recurrent neural networks and the attention mechanism. Generative models, variational autoencoders, and generative adversarial networks. Reinforcement learning, Q learning and design of simple AI systems. Python programming language. Emphasis on efficient implementation, optimization, and scalability. Creation of deep learning models in the context of different types of real applications such as image classification and language processing.
Economic applications of mathematical and statistical techniques: regression, estimators, hypothesis testing, lagged variables, discrete variables, violations of assumptions, simultaneous equations.
Honors section.
Honors section.
Print this page.
The PDF will include all information unique to this page.