Skip to main content

Statistical Programming and Data Management

Study Course Description

Course Description Statuss:Approved
Course Description Version:5.00
Study Course Accepted:04.09.2023 14:59:47
Study Course Information
Course Code:SL_108LQF level:Level 7
Credit Points:4.00ECTS:6.00
Branch of Science:Mathematics; Theory of Probability and Mathematical StatisticsTarget Audience:Life Science
Study Course Supervisor
Course Supervisor:Andrejs Ivanovs
Study Course Implementer
Structural Unit:Statistics Unit
The Head of Structural Unit:
Contacts:23 Kapselu street, 2nd floor, Riga, statistikaatrsu[pnkts]lv, +371 67060897
Study Course Planning
Full-Time - Semester No.1
Lectures (count)12Lecture Length (academic hours)2Total Contact Hours of Lectures24
Classes (count)12Class Length (academic hours)2Total Contact Hours of Classes24
Total Contact Hours48
Part-Time - Semester No.1
Lectures (count)12Lecture Length (academic hours)1Total Contact Hours of Lectures12
Classes (count)12Class Length (academic hours)2Total Contact Hours of Classes24
Total Contact Hours36
Study course description
Preliminary Knowledge:
No specific prerequisites are demanded, however computer skills, high-school level algebra and statistics concepts will be used in the course.
Objective:
Skills of the data analysis and management are critical to modern applied statistical research. The aim of the course is to introduce students to two statistical software tools used in biostatistics research: R and Jamovi. Thus, the objectives of the course are: • Introduce the students to the statistical programming and data management using R and Jamovi statistical software; • Prepare students for computer work in other courses of the Biostatistics study programme.
Topic Layout (Full-Time)
No.TopicType of ImplementationNumberVenue
1Introduction to R language and R Studio. Interface, workflow, scripts and coding basics. Introduction to data visualization with ggplot – aesthetic mappings, geometric objects and statistical transformations.Lectures1.00computer room
2Practice using R interface. Practice answering simple questions about data through creating ggplot2 visualizations of R built-in datasets.Classes1.00computer room
3Introduction to data wrangling (transformation) with dplyr package: filter, arrange, select, create new variables and summarize. Introducing R projects workflow.Lectures1.00computer room
4Practice various data transformations with R dplyr package on R built-in datasets.Classes1.00computer room
5Exploratory data analysis: assessing variation and covariation. Statistical summaries with boxplots. Reading various types of data into R.Lectures1.00computer room
6Practice exploratory data analysis in R: data variation (barplots, histograms, boxplots) and covariation (visualizing two variable relation). Practicing readr package for reading data.Classes1.00computer room
7Principles of consistently organizing data in R with tidyr package: gather and spread datasets, deal with NA values. Introducing R Markdown reports system, various Markdown formats and principles of writing mathematical symbols.Lectures1.00computer room
8Importing and tidying a real-life dataset. Creating an R Markdown report.Classes1.00computer room
9Introduction to data management in R. Relational data principles: relations, keys, joins and set operations. Some common non-numeric variable types in R. Organizing multiple operations with pipes.Lectures1.00computer room
10Managing relational data (i.e., multiple tables) with dplyr. Practice dealing with non-numeric variable types in R: factors, strings, dates.Classes1.00computer room
11Advanced R programming: writing your own R functions, function vectorization, conditional execution (if, else statements), for loops and map functions.Lectures1.00computer room
12Exercises on function writing and programming elements.Classes1.00computer room
13The first stage of data preparation. Data structuring and preparation for the Jamovi software.Lectures1.00computer room
14Practical class in data structuring.Classes1.00computer room
15Second stage of data preparation. Data cleaning, creation and transformation of new variables.Lectures1.00computer room
16Practical class in data cleaning and creation of new variables.Classes1.00computer room
17Introduction to Power Query.Lectures1.00computer room
18Use of artificial intelligence in data analysis.Classes1.00computer room
19Introduction to Jamovi software. Data input, export. Teaching R using Jamovi.Lectures1.00computer room
20Functions: Filter, Compute, Transform.Classes1.00computer room
21Practical lesson with functions: Filter, Compute, Transform.Lectures1.00computer room
22Fast data exploration. Data visualization with Jamovi.Classes1.00computer room
23Work in groups. From paper to prepared data.Lectures1.00computer room
24Work in groups. From paper to prepared data.Classes1.00computer room
Topic Layout (Part-Time)
No.TopicType of ImplementationNumberVenue
1Introduction to R language and R Studio. Interface, workflow, scripts and coding basics. Introduction to data visualization with ggplot – aesthetic mappings, geometric objects and statistical transformations.Lectures1.00computer room
2Practice using R interface. Practice answering simple questions about data through creating ggplot2 visualizations of R built-in datasets.Classes1.00computer room
3Introduction to data wrangling (transformation) with dplyr package: filter, arrange, select, create new variables and summarize. Introducing R projects workflow.Lectures1.00computer room
4Practice various data transformations with R dplyr package on R built-in datasets.Classes1.00computer room
5Exploratory data analysis: assessing variation and covariation. Statistical summaries with boxplots. Reading various types of data into R.Lectures1.00computer room
6Practice exploratory data analysis in R: data variation (barplots, histograms, boxplots) and covariation (visualizing two variable relation). Practicing readr package for reading data.Classes1.00computer room
7Principles of consistently organizing data in R with tidyr package: gather and spread datasets, deal with NA values. Introducing R Markdown reports system, various Markdown formats and principles of writing mathematical symbols.Lectures1.00computer room
8Importing and tidying a real-life dataset. Creating an R Markdown report.Classes1.00computer room
9Introduction to data management in R. Relational data principles: relations, keys, joins and set operations. Some common non-numeric variable types in R. Organizing multiple operations with pipes.Lectures1.00computer room
10Managing relational data (i.e., multiple tables) with dplyr. Practice dealing with non-numeric variable types in R: factors, strings, dates.Classes1.00computer room
11Advanced R programming: writing your own R functions, function vectorization, conditional execution (if, else statements), for loops and map functions.Lectures1.00computer room
12Exercises on function writing and programming elements.Classes1.00computer room
13The first stage of data preparation. Data structuring and preparation for the Jamovi software.Lectures1.00computer room
14Practical class in data structuring.Classes1.00computer room
15Second stage of data preparation. Data cleaning, creation and transformation of new variables.Lectures1.00computer room
16Practical class in data cleaning and creation of new variables.Classes1.00computer room
17Introduction to Power Query.Lectures1.00computer room
18Use of artificial intelligence in data analysis.Classes1.00computer room
19Introduction to Jamovi software. Data input, export. Teaching R using Jamovi.Lectures1.00computer room
20Functions: Filter, Compute, Transform.Classes1.00computer room
21Practical lesson with functions: Filter, Compute, Transform.Lectures1.00computer room
22Fast data exploration. Data visualization with Jamovi.Classes1.00computer room
23Work in groups. From paper to prepared data.Lectures1.00computer room
24Work in groups. From paper to prepared data.Classes1.00computer room
Assessment
Unaided Work:
• Individual work with lecture material preparing to all lectures of study course. • Independently prepare assigned 2 computer projects practicing the concepts studied in the course. In order to evaluate the quality of the study course as a whole, the student must fill out the study course evaluation questionnaire on the Student Portal.
Assessment Criteria:
Assessment on the 10-point scale according to the RSU Educational Order: • Two computer projects to be signed in (one related to R and one related to Jamovi): 25% × 2 = 50% • Final written exam – 50%.
Final Examination (Full-Time):Exam (Written)
Final Examination (Part-Time):Exam (Written)
Learning Outcomes
Knowledge:• Know, select and use independently main programming principles in R and Jamovi. • Use the database management principles in R and Jamovi. • Learn and operate advanced programming elements such as conditional execution, cycles and customized functions.
Skills:After this course, students will be able to: • Examine various types of data into the software and organize it for analysis. • Complete data transformation and visualization with R and Jamovi. • Write and use their own R functions to automate common tasks.
Competencies:Students will be able to: • Use qualitatively R and Jamovi software for statistical analysis in other biostatistics courses. • Recognize the differences between R and Jamovi programs and choose the most suitable for their analysis. • Profound their statistical programming skills independently, to perform research or analyse health-related data.
Bibliography
No.Reference
Required Reading
1Wickham, H. and Grolemund, G. 2016. R for Data Science. Import, Tidy, Transform, Visualize, and Model Data. O'Reilly.
2Navarro, D. J. and Foxcroft, D. R. 2022. learning statistics with jamovi: a tutorial for psychology students and other beginners. (Version 0.75). DOI: 10.24384/hgc3-7p15.
Other Information Sources
1https://www.statmethods.net/
2https://rstudio.com/resources/cheatsheets/
3https://ggplot2.tidyverse.org/