[SK쉴더스 루키즈 24기] 머신러닝 기초(6) 모델 저장 및 로드와 Colab

아굥·2025년 1월 2일

SK shieldus Rookies machine learning python

SK Shieldus Rookies

목록 보기

13/32

모델 파일로 저장과 활용

1. joblib

큰 NumPy 배열을 포함하는 객체를 저장하고 로드하는데 효율적

데이터 저장

import joblib
from sklearn.linear_model import LinearRegression
from sklearn.datasets import make_regression

# 샘플 데이터 생성 및 학습
X, y = make_regression(n_samples=100, n_features=1, noise=0.1)
model = LinearRegression()
model.fit(X, y)

# 파일 저장
joblib.dump(model, 'linear_model.pkl', compress=3)
print("Model saved successfully.")

데이터 로드

import joblib
from sklearn.datasets import make_regression

X, y = make_regression(n_samples=100, n_features=1, noise=0.1)

# 모델 로드
loaded_model = joblib.load('linear_model.pkl')
print("Model loaded successfully.")

# 모델 사용
y_pred = loaded_model.predict(X)
print(y_pred)

2. pickle

Python 객체를 직렬화/역직렬화 하는데 사용

데이터 저장

import pickle
from sklearn.linear_model import LinearRegression
from sklearn.datasets import make_regression

# 샘플 데이터 생성 및 학습
X, y = make_regression(n_samples=100, n_features=1, noise=0.1)
model = LinearRegression()
model.fit(X, y)

# 모델 저장
with open('linear_model2.pkl', 'wb') as f:		# 'wb': 쓰기 모드로 저장
    pickle.dump(model, f)

print("Model saved successfully.")

데이터 로드

import pickle
from sklearn.datasets import make_regression

X, y = make_regression(n_samples=100, n_features=1, noise=0.1)

# 모델 로드
with open('linear_model2.pkl', 'rb') as f:
    loaded_model = pickle.load(f)
print("Model loaded successfully.")

# 모델 사용
y_pred = loaded_model.predict(X)
print(y_pred)

3. 딥러닝 모델

데이터 저장

import numpy as np
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Dense
from tensorflow.keras.utils import to_categorical

# 데이터 생성
np.random.seed(42)
X = np.random.rand(100, 10)
y = np.random.randint(0, 2, 100)
y = to_categorical(y)

# 모델 생성 및 학습
model = Sequential([
    Dense(32, activation='relu', input_shape=(10,)),
    Dense(2, activation='softmax')
])
model.compile(optimizer='adam', loss='categorical_crossentropy', metrics=['accuracy'])
model.fit(X, y, epochs=5, batch_size=10, verbose=0)

# 모델 저장 (.keras 형식)
model.save('classification_model.keras')
print("Model saved successfully in .keras format.")

.keras 형식: Tensorflow/Keras에서 제공하는 새로운 표준

데이터 로드

import numpy as np
from tensorflow.keras.models import load_model

# 모델 로드
loaded_model = load_model('classification_model.keras')
print('Model loaded successfully.')

# 새로운 데이터에 대한 예측
sample_data = np.random.rand(1, 10)
prediction = loaded_model.predict(sample_data)
print(f'Prediction for {sample_data}: {prediction}')

Colab

Google Cloud 주피터(Jupyer) 노트북 기반으로 머신러닝 교육 및 연구에 널리 사용되는 툴

1. 특징

브라우저를 통해 무료로 코드 작성 및 실행
별도의 설정 X
클라우드에서 완전히 실행되는 무료 주피터(Jupyter) 노트북 환경
기본적인 머신러닝 라이브러리가 설치되어 있음(Pandas, NumPy, Tensorflow, Sklearn ...)
Free Computation Power: 코드는 전용 가상머신에서 실행, GPU 무료 사용
Easy Sharing and Collaboration: 작성한 코드를 Google 드라이브를 통해 저장 및 공유 가능
Integrated with GitHub: Colab 환경의 GitHub에서 노트북을 쉽게 저장 및 로드 가능

2. Google Colab 실행

https://colab.research.google.com

Google 계정 필요

아굥

열심히 살아보아요

이전 포스트

[SK쉴더스 루키즈 24기] 머신러닝 기초(5) CNN, DCNN, RNN

다음 포스트

[SK쉴더스 루키즈 24기] 머신러닝 기초(6) 모델 저장 및 로드와 Colab

SK Shieldus Rookies

모델 파일로 저장과 활용

1. joblib

데이터 저장

데이터 로드

2. pickle

데이터 저장

데이터 로드

3. 딥러닝 모델

데이터 저장

데이터 로드

Colab

1. 특징

2. Google Colab 실행

[SK쉴더스 루키즈 24기] 머신러닝 기초(5) CNN, DCNN, RNN

[SK쉴더스 루키즈 24기] 모듈 프로젝트1

0개의 댓글