[DL] MNIST with CNN

융·2023년 6월 28일

[Machine Learning & Deep Learning]

목록 보기

10/16

데이터 가져오기

import tensorflow as tf

mnist = tf.keras.datasets.mnist

(x_train, y_train), (x_test, y_test) = mnist.load_data()
x_train, x_test = x_train/255, x_test/255 # 픽셀의 최대값이 255이므로 255로 나눠서 스케일링

데이터 전처리

데이터를 28x28 크기의 2차원 이미지 형태에서 28x28x1 크기의 3차원 이미지 형태로 변환

60000 : 데이터의 개수를 나타내는 차원
28, 28 : 이미지의 높이와 너비를 나타낸다.
1 : 이미지의 채널 수, 흑백 이미지의 경우 채널 수가 1이다 (컬러는 3)

X_train = X_train.reshape((60000, 28, 28, 1))
X_test = X_test.reshape((10000, 28, 28, 1))

모델 생성

from tensorflow.keras import layers, models

model = models.Sequential([
    layers.Conv2D(32, kernel_size=(5,5), strides=(1,1), padding = 'same', activation = 'relu', input_shape=(28,28,1)),
    layers.MaxPool2D(pool_size=(2,2), strides=(2,2)),
    layers.Conv2D(64, (2,2), activation = 'relu', padding = 'same'),
    layers.MaxPool2D(pool_size=(2,2), strides=(2,2)),
    layers.Dropout(0.25),
    layers.Flatten(),
    layers.Dense(1000, activation = 'relu'),
    layers.Dense(10, activation = 'softmax')
    ])

Total params: 3,156,098

Model: "sequential"
_________________________________________________________________
 Layer (type)                Output Shape              Param #   
=================================================================
 conv2d (Conv2D)             (None, 28, 28, 32)        832       
                                                                 
 max_pooling2d (MaxPooling2D  (None, 14, 14, 32)       0         
 )                                                               
                                                                 
 conv2d_1 (Conv2D)           (None, 14, 14, 64)        8256      
                                                                 
 max_pooling2d_1 (MaxPooling  (None, 7, 7, 64)         0         
 2D)                                                             
                                                                 
 dropout (Dropout)           (None, 7, 7, 64)          0         
                                                                 
 flatten (Flatten)           (None, 3136)              0         
                                                                 
 dense (Dense)               (None, 1000)              3137000   
                                                                 
 dense_1 (Dense)             (None, 10)                10010     
                                                                 
=================================================================
Total params: 3,156,098
Trainable params: 3,156,098
Non-trainable params: 0

optimizer, loss 설정

model.compile(optimizer = 'adam', loss = 'sparse_categorical_crossentropy', metrics=['accuracy'])

모델 훈련

hist = model.fit(X_train, y_train, epochs=5, verbose=1, validation_data=(X_test, y_test))

성능 확인

score = model.evaluate(X_test, y_test)
score

loss: 0.0297 - accuracy: 0.9906

[0.02974139340221882, 0.9905999898910522]

예측 실패 데이터 확인

모델 저장

model.save('MNIST_CNN_model.h5')

융

개발하고싶은사람

이전 포스트

[DL] Convolutional Neural Network

다음 포스트

[DL] MNIST with CNN

[Machine Learning & Deep Learning]

데이터 가져오기

데이터 전처리

모델 생성

optimizer, loss 설정

모델 훈련

성능 확인

예측 실패 데이터 확인

모델 저장

[DL] Convolutional Neural Network

[NLP] BERT (Bidirectional Encoder Representations from Transformers)

0개의 댓글

관련 채용 정보