what is colaboratory(Colab)?
the environment where user can study data analysis using Python online(browser)
why do we use colab?
-don't need to install python
-access everywhere when the internet is enabled)
-faster than my computer
-similar to Java
1) Variable
can hold one value in one variable
if value is digit, they can do four basic operations( +, - , * , /)
2) List
can hold multiple values sequentially
When declaring individual variables for large number of values, using list can be helpful for managing them
3) Dictionary
can hold multiple values
key:value pair structure
1) import Panda library
2) go and get data
3) read data
4) remove blank field
4-1) identify the number of null value
4-2) remove null value
var.head() - print the first 5 rows of the data ( 5 - default/you can edit the number if needed) var.isnull().sum() - identify the number of null values var.dropna() -remove blank fields
5)analyze data - find out correlation
1) import matplotlib
2) draw graph
3) leave the part you will use for graph
3-1) specify the row to draw
4) remove unnecessary part of the graph
5) change to the bar graph
+numpy
used to data operations
can calculate large amount of complex numbers
+seaborn
used for data visualization
has various style of graph
Today I learned
-how to read and analyze data using Python
-visualize data on the graph based on the analyzed data
-characteristics of the libraries