Skip to content

Latest commit

 

History

History
74 lines (48 loc) · 3 KB

README.md

File metadata and controls

74 lines (48 loc) · 3 KB

Parallel Graph

Statement

Implement a generic thread pool, then use it to traverse a graph and compute the sum of the elements contained by the nodes. You will be provided with a serial implementation of the graph traversal and with most of the data structures needed to implement the thread pool. Your job is to write the thread pool routines and then use the thread pool to traverse the graph.

Support Code

The support code consists of the directories:

  • src/ is the skeleton parallel graph implementation.

  • utils/ utility files (used for debugging & logging)

Implementation

Thread Pool Description

A thread pool contains a given number of active threads that simply wait to be given specific tasks. The threads are created when the thread pool is created. Each thread continuously polls the task queue for available tasks. Once tasks are put in the task queue, the threads poll tasks, and start running them. A thread pool creates N threads upon its creation and does not destroy (join) them throughout its lifetime. That way, the penalty of creating and destroying threads ad-hoc is avoided.

  • enqueue_task(): Enqueue task to the shared task queue.
  • dequeue_task(): Dequeue task from the shared task queue.
  • wait_for_completion(): Wait for all worker threads.
  • create_threadpool(): Create a new thread pool.
  • destroy_threadpool(): Destroy a thread pool.

Graph Traversal

Once you have implemented the thread pool, you need to test it by doing a parallel traversal of all connected nodes in a graph. A serial implementation for this algorithm is provided in src/serial.c. To make use of the thread pool, you will need to create tasks that will be put in the task queue. A task consists of 2 steps:

  1. Add the current node value to the overall sum.
  2. Create tasks and add them to the task queue for the neighbouring nodes.

Implement this in the src/parallel.c (see the TODO items). You must implement the parallel and synchronized version of the process_node() function, also used in the serial implementation.

Synchronization

For synchronization I used mutexes, and condition variables.

Input Files

Reading the graphs from the input files is being taken care of the functions implemented in src/os_graph.c. A graph is represented in input files as follows:

  • First line contains 2 integers N and M: N - number of nodes, M - numbed or edges
  • Second line contains N integer numbers - the values of the nodes.
  • The next M lines contain each 2 integers that represent the source and the destination of an edge.

Data Structures

Graph

A graph is represented internally by the os_graph_t structure (see src/os_graph.h).

List

A list is represented internally by the os_queue_t structure (see src/os_list.h). You will use this list to implement the task queue.

Thread Pool

A thread pool is represented internally by the os_threadpool_t structure (see src/os_threadpool.h). The thread pool contains information about the task queue and the threads.