For many users, when starting a visualization project only two things are clear: They have an idea in mind - some message or story to be told - and they expect some meaningful result, an image, for sharing or publication. In between there are some more steps, though, that often remain somewhat fuzzy at first:
You need to get our hands on some suitable data, you need transform it into the required shape, and you need pick a proper visualization. On a first glance, this still sounds easy enough. But when we're looking into each of those steps, things are not quite as simple.
In order to get the data, often you need to find and access it first. and when you have the file in hand, you need to make sense of its content, its abbreviations etc.
Next, you need to transform it into something that you can actually use. This can mean to combine some datasets, or adding and removing some data.
Once this is done, you can to create the visualization to get your message across. you need to pick one type and tell your system which piece of your data goes where in the visualization.
Finally, you're done and can export the result. But also here, you need to keep a few things in mind: in particular, provenance and reproducibilty: You may want to be able to document what you did and you might want to do it again later.
You might need to go back a step or two and redo stuff, but overall, this is a pretty stereotypical workflow for static visualizations. For dynamic ones, there would be one more step adding some interaction. But for this thesis, we'll stick to just static visualizations.
While the workflow seems pretty straight forward, we'll now look at some of the issues that might happen along the way.