Property graphs are an intuitive way to model, analyze and visualize complex relationships among heterogeneous data objects, for example, as they occur in social, biological and information networks. These graphs typically contain thousands or millions of vertices and edges and their entire representation can easily overwhelm an analyst.
GRADOOP is an open-source system for graph analytics that enables handling of such graphs. GRADOOP is implemented on top of Apache Flink, a state-of-the-art distributed dataflow framework, and thus allows us to scale graph analytical programs across multiple machines.
In this application we demonstrate two GRADOOP operators: Graph Grouping and Cypher-based Graph Pattern Matching. A user can choose between different input graphs and adjust the operator parameters. The computation is executed on the user machine by automatically starting an Apache Flink cluster.
One way to reduce complexity of a large property graph is the grouping of vertices and edges to summary graphs. We developed an algorithm for graph grouping with support for attribute aggregation and structural summarization by user-defined vertex and edge properties. The operation is very similar to a GROUP BY operation known from relational databases but in addition summarizes the graph structure according to the computed vertex and edge groups.
Graph pattern matching is an important and challenging operation on graph data. Typical use cases are related to graph analytics. Since analysts are often non-programmers, a graph system will only gain acceptance, if there is a comprehensible way to declare pattern matching queries. However, respective query languages are currently only supported by graph databases but not by distributed graph processing systems. To enable pattern matching on a large scale, we implemented core features of the declarative graph query language Cypher within GRADOOP.
- Clone the repo
$ mvn clean install
$ mvn exec:java -Dexec.mainClass="org.gradoop.demo.server.Server"
- Navigate to
http://localhost:2342/gradoop/html/grouping.html
orhttp://localhost:2342/gradoop/html/cypher.html
- Select data set from drop down list
- Enjoy
- Create a CSV graph using Gradoop (see gradoop-examples)
- Copy CSV graph to
src/main/resources/data
- Add it to the input graph drop-down of the html files
- Restart the server