D3js force-directed graph on cloud file-sharing

We demo a graph of cloud file-sharing events in an organization over 1 day.

In our setting, we have two kinds of entities: users + files, and one kind of relationship: file sharing.

The idea can be generalized to other entities and/or relationships.

The original raw-data comes from a real cloud-activity log.

All data is anonymized. No actual user/file names are exposed.

Interactive Demo (can't embed, so not on github)

Demo source code:

Graphical elements of file sharing

Blue circle nodes represent files
Black circle nodes represent users
Red circles represent users outside the organization (email-id in another domain)
Edges (links between nodes) are always between a user and a (shared) file
Each user/file has a unique id and appears exactly once in the chart
The entities (file, user) are what the forces act on
The relations (links) between the entities (nodes) may have additional metadata describing the sharing event, e.g the cloud-application used for sharing
A red & thick edge represents sharing via an unusual/rare application

Interacting with the graph

Hover over circles to see the (anonymized) user/file ids
Click and drag some circle; the graph self-organizes again
Try to untangle some knots
Watch how the graph reacts when you release your mouse button
Click & drag an isolated small sub-graph over a long distance
Release & watch it settle in a new niche

What exactly is a force-directed graph?

The forces that make this graph self-organize are somewhat like the negative and positive charges in polar molecules. E.g. in a water (H2O) molecule, there's a 104.5 ° (less than 180 °) angle between the two H-0-H covalent bonds. The asymmetry makes each molecule a small dipole.

Opposite charges (a user & a file) attract each other
Similar charges (two files, or two users) repel each other
A global force: gravity pulls all objects towards the center of the canvas
A link-length (stand-off) parameter keeps linked objects from merging/colliding
A time-decay parameter gradually dampens forces as the graph settles

The attraction & repulsion forces cause the chart to self-organize into a minimum potential-energy state.

Whenever disturbed, the time-decay parameter is reinitialized to a high value, the forces increase, and the graph gradually settles into a new state of equilibrium as the time-decay parameter re-converges towards zero.

The settling into equilibrium is very similar to a typical machine learning training process, a gradient descent algo, iteratively minimizing some loss/error function until convergence.

The algorithm used to go from the initial state to equilibrium/stability is called Verlet integration and is described as a "numerical method used to integrate Newton's equations of motion."

Why is a force-directed graph valuable?

The biggest value of a force-directed visualization is that it creates a meaningful global holistic view from many small & unrelated local details.

There’s no model we need to build based on some pre-conceived structure. The whole is a natural and direct result of its small parts.

IOW: the data computes itself into a picture that makes all relationships very clear to the observer.

Another way to look at this method is that it reverses the traditional roles of client and server in ML. Normally, we train a machine-learning model on the server, and present the final results to the client. Here we’re doing the opposite: we have pure client-side (JavaScript in the browser) machine-learning. The computation heavy-lifting is off-loaded to the observer CPU.

A picture is worth a 1000 words

So the saying goes, but a self-organizing picture, which creates itself, and with which you can interact, may bring even more insight and understanding.

The graph tells many stories:

Some users share much more than others
Some small cliques of users as islands in the big picture
Some users share files only one person
Some teams are formed around single shared files, others have large shared file-sets.
Sub-graphs may represent sub-departments (or users who like to share with like minded colleagues)
Red circles may hint of a potential ex-filtration of data
Red links represent abnormal (non-sanctioned) sharing cloud apps
There's one isolated file (File 499), this is a data issue. In a picture, data-issues are often much easier to detect than by looking at the raw data. See also: A picture is worth a 1000 words.

Random notes

Graph object can be annotated to include more information in the (hover) tool-tips: for examples we can add date and application related to each file sharing, we can annotate the links as well.
Force related constants which affect the speed of settling into equilibrium (time-decay or theta), the global gravity force, the link stand-off lengths, the friction or "stiffness" of the chart, and others, can be configured in the code.
Without any sharing, all users are unrelated to each other and will float randomly in space. When two users share files among themselves, they become globally attracted to these files and end-up closer to each other. The more files are shared between two users, the harder it becomes to pull them apart.
The demo uses an old version (v3) of D3js, that is incompatible with the current (v7) version.

Demo in a different domain: Beatles songs by year

Beatles songs by year interactive demo (can't embed, so not on github)

Credits

Mike Bostock, observablehq.com and all contributors to the D3js framework.

I started from some of the Force directed graph published examples. Used a different dataset; separated the raw data from the code, and customized some parameters for better effect.

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
.gitignore		.gitignore
README.md		README.md
d3.v3.min.js		d3.v3.min.js
datobj.json		datobj.json
force-directed-graph.js		force-directed-graph.js
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

D3js force-directed graph on cloud file-sharing

Demo source code:

Graphical elements of file sharing

Interacting with the graph

What exactly is a force-directed graph?

Why is a force-directed graph valuable?

A picture is worth a 1000 words

The graph tells many stories:

Random notes

Demo in a different domain: Beatles songs by year

Credits

About

Releases

Packages

Languages

arielf/fileshare-d3js

Folders and files

Latest commit

History

Repository files navigation

D3js force-directed graph on cloud file-sharing

Demo source code:

Graphical elements of file sharing

Interacting with the graph

What exactly is a force-directed graph?

Why is a force-directed graph valuable?

A picture is worth a 1000 words

The graph tells many stories:

Random notes

Demo in a different domain: Beatles songs by year

Credits

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages