Regularization

March 31, 2022 최대 1 분 소요

Regularization

Fundamental concept

Normalization is used as a way to prevent over-fitting of the model.
In over-fitting, the error between the execution result and the correct answer data is very small, but the error between the result of executing the model with test data and the correct answer data is very large.

Sample Code

import numpy as np
from sklearn.preprocessing import PolynomialFeatures
from sklearn.linear_model import Ridge
from sklearn.metrics import mean_squared_error

train_size = 20

test_size = 12

train_X = np.random.uniform(low = 0, high = 1.2, size = train_size)
test_X = np.random.uniform(low = 0.1, high = 1.3, size = test_size)

train_y = np.sin(train_X * 2 *np.pi) + np.random.normal(0, 0.2, train_size)
test_y = np.sin(test_X * 2 * np.pi) + np.random.normal(0, 0.2, test_size)

poly = PolynomialFeatures(6)

train_poly_X = poly.fit_transform(train_X.reshape(train_size, 1))
test_poly_X = poly.fit_transform(test_X.reshape(test_size, 1))

model = Ridge(alpha = 1.0)

model.fit(train_poly_X, train_y)

train_pred_y = model.predict(train_poly_X)
test_pred_y = model.predict(test_poly_X)

print(mean_squared_error(train_pred_y, train_y))
print(mean_squared_error(test_pred_y, test_y))

0.14826465725405447
0.40761309030001885

참고문헌

秋庭伸也 et al. 머신러닝 도감 : 그림으로 공부하는 머신러닝 알고리즘 17 / 아키바 신야, 스기야마 아세이, 데라다 마나부 [공] 지음 ; 이중민 옮김, 2019.

Twitter Facebook LinkedIn

✨️Going Beyond Local: Global Graph-Enhanced Personalized News Recommendation

July 24, 2025 12 분 소요

✨️Going Beyond Local: Global Graph-Enhanced Personalized News Recommendation 논문 정보 Boming Yang, Dairui Liu, Toyotaro Suzumura, Ruihai Dong, Irene Li Rec...

LLM, RAG, KG, RecSys 트렌드 리뷰

July 20, 2025 11 분 소요

LLM, RAG, KG, RecSys 트렌드 리뷰 LLM: Large Language Model https://ko.wikipedia.org/wiki/대형_언어_모델 RAG: Retrieval-Augmented Generation ...

사고 실험보단 컴퓨터공학, 세상의 이치로 풀어써본 P, NP, NP-난해, NP-완전

June 2, 2025 4 분 소요

사고 실험보단 컴퓨터공학, 세상의 이치로 풀어써본 P, NP, NP-난해, NP-완전 P-NP 문제 컴퓨터 과학, 수학계의 최종 보스인 밀레니엄 문제 중 하나 P 집합과 NP 집합이 같은지 다른지를 증명해야 하는 문제 P 집합은 이미(당연히) NP 집합의 부분집합 ...

싱글 머신에서의 K8s 기반 DistDGL 분산 학습 환경 구축 - 5편, DistDGL 분산 학습

February 20, 2025 2 분 소요

K8s 클러스터에서의 DistDGL 분산학습 1. 단일 컨테이너에서의 DistDGL 학습 1-1. WSL2 환경 구축 WSL2 Ubuntu 24.04 LTS 환경을 기반으로 단일 노드에서 작동하는 DistDGL dockerfile을 작성할 것이다 관련 문서는 아래 홈페이지...

Juyeong Shin

Regularization

Regularization

Fundamental concept

Sample Code

참고문헌

공유하기

댓글남기기

참고

✨️Going Beyond Local: Global Graph-Enhanced Personalized News Recommendation

LLM, RAG, KG, RecSys 트렌드 리뷰

사고 실험보단 컴퓨터공학, 세상의 이치로 풀어써본 P, NP, NP-난해, NP-완전

싱글 머신에서의 K8s 기반 DistDGL 분산 학습 환경 구축 - 5편, DistDGL 분산 학습