Site icon EdwardsLab

Project Update: Multi-threading or Cluster Computing?

Recently, I’ve been faced with a problem where I feel my metagenome comparator program is running too slow. The main reason behind it is that it’s performing operations that occur multiple times in a loop. These operations involve different tasks such as: reading lines from text, creating objects, inserting those objects into a data structure, retrieving those objects from the data structure, and writing the data structures to disk (just to name a few). So it would be natural to suggest to someone in my position to parallelize it all, and that’s exactly what I want to do. However, I’ve never written any type of parallel applications, and thus, I need to do a little bit of learning and researching into parallel programming. (More of my ramblings after the Read More break)

What I Know

What I Need

I am hoping to make a choice soon so I can get this rolling. I need to look into running jobs in our cluster and find out the peculiarities that come along with it. Multi-threading looks to be straightforward with only adjusting the necessary methods to accommodate the limited amount of memory available in Octopussy.

Exit mobile version