Cleaning Data Project

This work was developed as the final project of the course Getting and Cleaning Data offered by The Johns Hopkins University through the Coursera platform.

Abstract about the data

Human Activity Recognition database built from the recordings of 30 subjects performing activities of daily living (ADL) while carrying a waist-mounted smartphone with embedded inertial sensors. The experiments have been carried out with a group of 30 volunteers within an age bracket of 19-48 years. Each person performed six activities (WALKING, WALKING_UPSTAIRS, WALKING_DOWNSTAIRS, SITTING, STANDING, LAYING) wearing a smartphone (Samsung Galaxy S II) on the waist. Using its embedded accelerometer and gyroscope, we captured 3-axial linear acceleration and 3-axial angular velocity at a constant rate of 50Hz. For more datails access Human Activity Recognition Using Smartphones Data Set. Obs: All information was collected from the official repository.

Data Organization

The experiment data were separated into two large sets, one for training and one for testing there are 30 individuals who performed 6 different types of activities and for each observation, there are 561 variables that keep normalized values ranging from 1 to -1. For more details consult the README of the data.

Project Proposal

The goal of the project is to deliver three artifacts, the first is the complete dataset containing the training and test data, with the correctly named columns and the activities with descriptive names, the second and third artifacts consist of analyzes that must be performed on the full dataset:

Second artifact: A data set that contains the mean and standard deviation of each of the 561 variables.
Third artifact: A data set containing the average of 88 mean and standart deviation variables, segmented by the activity to which they belong and which subject performed them.

"run_analysis.R" Script

This script completes all the steps described in CodeBook.md, reads all data, assembles all datasets (training, testing, full), filtered per mean and standart deviation variables and performs the analysis described in Project Proposal.

Finished Datasets

In the folder data are the three project artifacts, exported in .csv and .txt format, they are:

First artifact: "UCI_HAR_full_data.csv"
Second artifact: "mean_n_sd_all_data.txt"
Third artifact: "mean_per_act_by_suj_analysis.txt"

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data		data
CodeBook.md		CodeBook.md
LICENSE		LICENSE
README.md		README.md
run_analysis.R		run_analysis.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cleaning Data Project

Abstract about the data

Data Organization

Project Proposal

"run_analysis.R" Script

Finished Datasets

About

Releases

Packages

Languages

License

Galtvam/cleaning-data

Folders and files

Latest commit

History

Repository files navigation

Cleaning Data Project

Abstract about the data

Data Organization

Project Proposal

"run_analysis.R" Script

Finished Datasets

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages