[2012 OSDI] PowerGraph: Distributed Graph-Parallel Computation on Natural Graphs

One-line Summary

The key contributions are:

Introduction
Graph-Parallel Abstractions
1. Pregel
2. GraphLab
3. Characterization
Challenges of Natural Graphs
PowerGraph Abstraction
1. GAS Vertex-Programs
2. Delta Caching
3. Initiating Future Computation
  1. Bulk Synchronous Execution
  2. Asynchronous Execution
4. Comparison with GraphLab/Pregel
Distributed Graph Placement
1. Balanced p-way Vertex-Cut
2. Greedy Vertex-Cuts
Abstraction Comparison
1. Computation Imbalance
2. Communication Imbalance
3. Runtime Comparison
Implementation and Evaluation
1. Graph Loading and Placement
2. Synchronous Engine (Sync)
3. Asynchronous Engine (Async)
4. Async. Serializable Engine (Async+S)
5. Fault Tolerance
6. MLDM Applications
Related Work
Conclusions and Future Work

Background 1: Natural Graphs
- Graphs IRL (e.g., social networks/the Internet) follow a power-law degree distribution
  - A small subset of the vertices have very high degrees, while most vertices have a small degree
- Existing graph-parallel frameworks depend on a balanced degree distribution for performance

Background 2: Existing frameworks (Pregel, GraphLab) cannot handle natural graphs well
- Work balancing: Existing graph-parallel frameworks treat vertices symmetrically and have storage/communication/computation costs linear in degree
- Partitioning: Pregel/GraphLab depends on partitioning the graph, which is hard to do in natural graphs. Their solution, random partitioning, is bad.
- Communication/storage: Major bottlenecks at high-degree vertices due to the skewed distribution
- Computation: Existing frameworks do not parallelize individual vertex programs, limiting their scalability in skewed graphs

Gather: Information from adjacent vertices/edges is reduced by a generalized "sum" operation (commutative and associative)
Apply: The gathered sum is used with the current value to update the current vertex value
Scatter: The new value is used to update data on adjacent edges

Edge-Cuts
- Every vertex is placed on a machine, and edges span across machines
  - If adjacent vertices are on different machines, they use "ghost" vertices -> changes need to be synchronized to ghosts
- In natural graphs, there are lots of edges spanned across machines; Balanced edge-cut algorithms perform poorly, so GraphLab and Pregel uses randomized placement (bad)
Vertex-Cuts
- Every edge is placed on a machine, and vertices may be across machines
  - Intuition: The distribution of vertex degree is highly skewed, but the number of vertices adjacent to a given edge is constant (always 2)
  - Each vertex is replicated ("mirrors") across the machines where its adjacent edges lie
- This results in a better balance for natural graphs

Delta caching
- At each vertex, the accumulator values are cached, and the scatter function can return a delta value to directly apply to the neighboring cached accumulator.
- If this value is not returned, the neighboring cache is cleared
Execution model: Sync vs. Async
- Sync (bulk synchronous)
  - 3 "minor-steps": Gather for all active vertices -> Apply -> Scatter
  - Barrier after each minor-step; Changes are committed at the end of each minor-step and visible on the next
- Async (asynchronous)
  - Changes are immediately available to other vertices
  - Execute active vertices as cores become available

Last updated 3 years ago

Was this helpful?