๐Ÿผ pandas๊ฐ€ ๋ญ˜๊นŒ?

yeeun leeยท2020๋…„ 7์›” 1์ผ
1

์–ผ๋งˆ ์ „ ์„œ์ ์—์„œ ๊ฐœ๋ฐœ ๊ด€๋ จ ์„œ์ ์„ ์ฐพ์•„๋ณด๋‹ค๊ฐ€, pandas ๊ด€๋ จ ์ฑ…๋“ค์„ ๋งŽ์ด ๋ณด์•˜๋‹ค. ์ด๋ฆ„์€ ํŒ๋‹ค ๊ฐ™์•„์„œ ๊ท€์—ฌ์šด๋ฐ ๋ฌด์—‡์ผ์ง€ ๊ถ๊ธˆํ•ด์„œ ์ฐพ์•„๋ณด์•˜๋‹ค.

Definition

์œ„ํ‚ค๋ฐฑ๊ณผ๊ฐ€ ๋งํ•˜๊ธธ pandas๋Š” ๋ฐ์ดํ„ฐ๋ฅผ ๋‹ค๋ฃจ๊ณ  ๋ถ„์„ํ•˜๊ธฐ ์œ„ํ•œ python library๋‹ค. ํŠนํžˆ ์ˆซ์ž ๊ฐ’์œผ๋กœ ์ด๋ฃจ์–ด์ง„ ํ‘œ๋‚˜ ์‹œ๊ณ„์—ด ์ž๋ฃŒ์˜ ๋ฐ์ดํ„ฐ ๊ตฌ์กฐ ๋ฐ ์šด์˜์„ ๋‹ค๋ฃจ๋Š” ๊ฒƒ์„ ๋•๋Š”๋‹ค.

In computer programming, pandas is a software library written for the Python programming language for data manipulation and analysis.

In particular, it offers data structures and operations for manipulating numerical tables and time series. It is free software released under the three-clause BSD license.

Dataframes

pandas๋Š” ์ฃผ๋กœ ๋ฐ์ดํ„ฐ ๋ถ„์„์„ ์œ„ํ•ด ํ™œ์šฉ๋œ๋‹ค. csv, json, sql, exel ๋“ฑ ๋‹ค์–‘ํ•œ ํŒŒ์ผ ํ˜•ํƒœ๋ฅผ importํ•  ์ˆ˜ ์žˆ๋‹ค.
pandas๋ฅผ ํ†ตํ•ด ๋ฐ์ดํ„ฐ๋ฅผ ํ•ฉ์น˜๊ฑฐ๋‚˜ ๋ชจ์–‘์„ ๋ฐ”๊พธ๊ณ , ์„ ํƒํ•˜๊ณ , ๊นจ๋—ํ•˜๊ฒŒ ๋งŒ๋“ค๊ณ , ๋žญ๊ธ€๋งํ•  ์ˆ˜ ์žˆ๋‹ค.

  • ๋ฐ์ดํ„ฐ ๋žญ๊ธ€๋ง: ํ˜น์€ ๋ฐ์ดํ„ฐ ๋จผ์ง•์€ ์›์ž๋ฃŒ๋ฅผ ๋˜๋‹ค๋ฅธ ํ˜•ํƒœ๋กœ ์ˆ˜์ž‘์—…์œผ๋กœ ์ „ํ™˜ํ•˜๊ฑฐ๋‚˜ ๋งคํ•‘ํ•˜๋Š” ๊ณผ์ •์ด๋‹ค. ์ด๋ฅผ ํ†ตํ•ด์„œ ๋ฐ˜์ž๋™ํ™” ๋„๊ตฌ์˜ ๋„์›€์œผ๋กœ ๋ฐ์ดํ„ฐ๋ฅผ ์ข€๋” ํŽธ๋ฆฌํ•˜๊ฒŒ ์†Œ๋น„ํ•œ๋‹ค.

Pandas is mainly used for data analysis. Pandas allows importing data from various file formats such as comma-separated values, JSON, SQL, Microsoft Excel.

Pandas allows various data manipulation operations such as merging, reshaping, selecting, as well as data cleaning, and data wrangling features.

profile
์ด์‚ฌ๊ฐ„ ๋ธ”๋กœ๊ทธ: yenilee.github.io

0๊ฐœ์˜ ๋Œ“๊ธ€