Juyeong Shin

Juyeong Shin

KHU CSE 19
KHU DKE Lab.

GraphSAGE

July 3, 2023 2 분 소요

Inductive Representation Learning on Large Graphs, NIPS 2017

거대 그래프를 위한 Inductive 학습 기법

저자

Wiliam L. Hamilton
Rex Ying
Jure Leskovec

0. Preliminary Knowledge

1. Graph??

그래프 데이터란?

2. Transductive Learning vs Inductive Learning

Transductive Learning vs Inductive Learning

1. Introduction

1. 기존 고차원 노드 피쳐 벡터 임베딩 기법

머신러닝 기반 임베딩 방법을 주로 사용
이때, 사용되는 데이터는 오직 노드 피쳐 벡터
- 머신러닝 기법: PCA, NMA, t-SNE
위와 같은 기법으로 임베딩 된 피쳐는 Node Classification, Clustering, Link Prediction 등에 사용
혹은, ChebNet, GCN과 같은 Transductive 딥러닝 기법을 사용
- Transductive 성질로 인해, 실세계에 적용시키기 어려운 한계점을 지님
  - 미니배치 학습 불가
  - 온라인 학습 불가
  - 분산 학습 불가
  - 시간 효율성 떨어짐
  - 메모리 부족 문제

2. 제안하는 Inductive Learning Method 장점

미니배치 학습 가능
온라인 학습 및 추론 가능
- 실세계 거대 그래프에서 응용될 수 있음

3. GraphSAGE 특징

노드 임베딩을 구하기 위해 사용되는 범용적 framework
Inductive Learning Model은 Neighborhood Sampling과 Aggregation 과정을 통해 노드 임베딩을 구함
Aggregation은 이웃 노드 피쳐와 최종 레이어의 임베딩 결과를 도출
최종 임베딩 결과는 NN(Neural Network, 신경망) Model Parameter Update에 사용
비지도학습, 지도학습 모두 가능

2. 관련 연구

1. Factorization-based embedding approaches

low dimensional embeddings using random walk
- baseline algorithm: PageRank Algorithm
- Deepwalk: Online learning of social representations. In KDD, 2014.
- node2vec: Scalable feature learning for networks. In KDD, 2016.
matrix factorization-based learning objectives
- baseline algorithm: Planetoid-$I$
- Line: Large-scale information network embedding. In WWW, 2015.
- Structural deep network embedding. In KDD, 2016.

2. Supervised learning over graphs

supervised learning over graph-structured data
- Discriminative embeddings of latent variable models for structured data. In ICML, 2016.
- A new model for learning in graph domains. In IEEE International Joint Conference on Neural Networks, volume 2, pages 729–734, 2005.
- Gated graph sequence neural networks. In ICLR, 2015.
- The graph neural network model. IEEE Transactions on Neural Networks, 20(1):61–80, 2009.

3. Spectral Graph Convolutional Networks

Spectral Method Based Graph Convolutional Networks
- Spectral Networks and Locally Connected Networks on Graphs
- Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering
- Semi-Supervised Classification with Graph Convolutional Networks

3. 제안 기법: GraphSAGE

핵심 아이디어
- 이웃 노드들을 무작위 추출 (샘플링)
- 임베딩을 구하고자 하는 노드의 지엽적 이웃의 피쳐 정보를 Aggregate하여 NN 모델 학습
- GD (Gradient Descent)만 가능한 기존 Spectral Method 기반 GCN과 달리, SGD (Stochastic Gradient Descent)가 가능

1. 임베딩 생성 알고리즘 (GraphSAGE Forward Propagation Algorithm)

Notations

Graph: $G(V, E)$
input features: $x_v$
depth: $K$
neighborhood: $N:v\to{2^v}$
$K$ aggregator functions: $AGGREGATE_k, \forall{k}\in{1, …, K}$
set of weight matrices: $W^k, \forall{k}\in{1, …, K}$
- used to propagate information between different layers of the model or “search depths”
non-linearity activation: $\sigma$

GraphSAGE Forward Propagation Algorithm

3. 실험

4. 결론

Graph Inductive Learning Method 개발
Speed, Scalability

5. 참고문헌

https://proceedings.neurips.cc/paper_files/paper/2017/hash/5dd9db5e033da9c6fb5ba83c7a7ebea9-Abstract.html

공유하기

Twitter Facebook LinkedIn

댓글남기기

참고

✨️Going Beyond Local: Global Graph-Enhanced Personalized News Recommendation

July 24, 2025 12 분 소요

✨️Going Beyond Local: Global Graph-Enhanced Personalized News Recommendation 논문 정보 Boming Yang, Dairui Liu, Toyotaro Suzumura, Ruihai Dong, Irene Li Rec...

LLM, RAG, KG, RecSys 트렌드 리뷰

July 20, 2025 11 분 소요

LLM, RAG, KG, RecSys 트렌드 리뷰 LLM: Large Language Model https://ko.wikipedia.org/wiki/대형_언어_모델 RAG: Retrieval-Augmented Generation ...

사고 실험보단 컴퓨터공학, 세상의 이치로 풀어써본 P, NP, NP-난해, NP-완전

June 2, 2025 4 분 소요

사고 실험보단 컴퓨터공학, 세상의 이치로 풀어써본 P, NP, NP-난해, NP-완전 P-NP 문제 컴퓨터 과학, 수학계의 최종 보스인 밀레니엄 문제 중 하나 P 집합과 NP 집합이 같은지 다른지를 증명해야 하는 문제 P 집합은 이미(당연히) NP 집합의 부분집합 ...

싱글 머신에서의 K8s 기반 DistDGL 분산 학습 환경 구축 - 5편, DistDGL 분산 학습

February 20, 2025 2 분 소요

K8s 클러스터에서의 DistDGL 분산학습 1. 단일 컨테이너에서의 DistDGL 학습 1-1. WSL2 환경 구축 WSL2 Ubuntu 24.04 LTS 환경을 기반으로 단일 노드에서 작동하는 DistDGL dockerfile을 작성할 것이다 관련 문서는 아래 홈페이지...