Uploaded by MRIGAANK JASWAL

dt8

advertisement
University Institute of Engineering
Department ofComputer Science & Engineering
Experiment number : 98
Student Name:Mrigaank Jaswal
UID:22BCS14681
Branch: Computer Science & Engineering
Section/Group: 404/B
Semester: First
Date of Performance: 7 DEC 2022
Subject Name: DISRUPTIVETECHNOLOGIES-I
Subject Code: 22ECH-102
1.Aim of the practical: Apply appropriate machinelearning model for
accurateprediction of air quality index.
2. Tool Used: Google Colab , Pandas Module , Numpy module
Panda module 
pandas is a Python package that provides fast, flexible, and expressive data structures
designed to make working with "relational" or "labeled" data both easy and intuitive. It aims
to be the fundamental high-level building block for doing practical, real world data analysis
in Python. Additionally, it has the broader goal of becoming the most powerful and flexible
open source data analysis / manipulation tool available in any language. It is already well
on its way towards this goal.
Numpy module 
NumPy is a module for Python. The name is an acronym for "Numeric Python" or "Numerical
Python". It is pronounced /ˈnʌmpaɪ/ (NUM-py) or less often /ˈnʌmpi (NUM-pee)). It is an
extension module for Python, mostly written in C. This makes sure that the precompiled
mathematical and numerical functions and functionalities of Numpy guarantee great execution
speed.
University Institute of Engineering
Department ofComputer Science & Engineering
3. Basic Concept/ Command Description:
Machine Learning 
Machine learning (ML) is a field of inquiry devoted to understanding and building
methods that 'learn', that is, methods that leverage data to improve performance on some
set of tasks.[1] It is seen as a part of artificial intelligence. Machine learning algorithms
build a model based on sample data, known as training data, in order to make predictions
or decisions without being explicitly programmed to do so.[2] Machine learning
algorithms are used in a wide variety of applications, such as in medicine, email
filtering, speech recognition, agriculture, and computer vision, where it is difficult or
unfeasible to develop conventional algorithms to perform the needed tasks.[3][4] A
subset of machine learning is closely related to computational statistics, which focuses on
making predictions using computers, but not all machine learning is statistical learning.
The study of mathematical optimization delivers methods, theory and application
domains to the field of machine learning. Data mining is a related field of study, focusing
on exploratory data analysis through unsupervised learning.[6][7] Some implementations
of machine learning use data and neural networks in a way that mimics the working of
a biological brain.[8][9] In its application across business problems, machine learning is
also referred to as predictive analytics.
4. Code:
Task 1  Download required modules . Download Numpy and Pandas
Task 2  Now Input the data set into Your Project file and provide Variables to the
File path .
University Institute of Engineering
Department ofComputer Science & Engineering
Task 3  Now Import the Data and subsetting the station .
Task 4  Now We will use head() function to view the first n rows of a pandas
DataFrame.
University Institute of Engineering
Department ofComputer Science & Engineering
Task 5  Now We will use Group the Data with pandas DataFrame.
And later we will do Sub Indexing calculations
Task 6 Now we Calculate sub Indexing of various Groups and will provides their in
Comments
University Institute of Engineering
Department ofComputer Science & Engineering
University Institute of Engineering
Department ofComputer Science & Engineering
University Institute of Engineering
Department ofComputer Science & Engineering
\
University Institute of Engineering
Department ofComputer Science & Engineering
University Institute of Engineering
Department ofComputer Science & Engineering
Task 7 Now We have to find AQI values of Given by Rounding off all the sub indexing
University Institute of Engineering
Department ofComputer Science & Engineering
University Institute of Engineering
Department ofComputer Science & Engineering
Task 8 Now We have to Print Number of condition found as result of AQI Values
University Institute of Engineering
Department ofComputer Science & Engineering
Task 9  Now we will have to Read the CSV data in Panda
University Institute of Engineering
Department ofComputer Science & Engineering
Task 10  Now we will have to load the CSV data in Panda and print the result
University Institute of Engineering
Department ofComputer Science & Engineering
Learning outcomes (What I have learnt):
1. I get Know about Air Quality Index
2. Learnt about Panda
3. Learnt About Numpy
4. I get know about the accuracy of the Dataset
Evaluation Grid (To be filled by Faculty):
Sr. No.
1.
2.
3.
Parameters
Student Performance (task
implementation and result evaluation)
Viva-Voce
Worksheet Submission (Record)
Signature of Faculty (with Date):
Marks Obtained
Maximum Marks
12
10
Total Marks Obtained:
8
30
Download