Dataset


Attribution Graphs

Name Type #nodes #edges #attributes #labels URL
Cora directed 2,708 5,429 1,433 7 [raw]
Citeseer directed 3,327 4,732 3,703 6 [raw]
Pubmed directed 19,717 44,338 500 3 [raw]
BlogCatalog undirected 5,196 343,486 8,189 6 [raw]
PPI undirected 56,944 81,8716 50 121 [raw]
Flickr undirected 7,575 47,9476 12,047 9 [raw]
Reddit undirected 232,965 11,606,919 602 41 [raw]
Ogbn-ArXiv undirected 169,343 1,157,799 128 40 [raw]

These datasets are also available in Pytorch-Geometric. Node attributes can be loaded as a sparse matrix using the following code

from scipy import sparse
features = sparse.load_npz("attrs.npz")

Simple Graphs

Name #nodes #edges #labels Type URL
com-Amazon 334,863 925,872 151,037 undirected [raw]
com-DBLP 317,080 1,049,866 13,477 undirected [raw]
com-Youtube 1,134,890 2,987,624 8,385 undirected [raw]
com-LiveJournal 3,997,962 34,681,189 287,512 undirected [raw]
com-Orkut 3,072,441 117,185,083 6,288,363 undirected [raw]
com-Friendster 65,608,366 1,806,067,135 957,154 undirected [raw]
com-Friendster-small 7,944,949 447,219,610 100 undirected [raw] [raw]
Pokec 1,632,803 30,622,564 - directed [raw]
Twitter 41,652,230 1,468,365,182 - directed [raw]
LiveJournal 4,847,571 68,475,391 - directed [raw]

Dataset Repositories

Name Type Collected by
SNAP Graphs & Networks Stanford
KONECT Graphs & Networks Jérôme Kunegis
Aminer Aminer Academic Networks AMiner
LAW Graphs & Networks UNIMI
Network Repository Graphs & Networks -
UCI Network Data Repository Graphs & Networks UCI Datalab
Open Academic Graph Academic Networks Microsoft
Open Graph Benchmark Graphs & Networks Stanford
UCI Machine Learning Repository Multivariate/Text/Time-Series UCI
Yelp Open Dataset businesses/reviews/user data Yelp
BioSNAP Biomedical Networks Stanford
BioGRID Biomedical Networks Toronto
TianChi Graphs & Networks Tianchi