I've now written several posts on data organization and record keeping, and I realized I still haven't managed to say what I wanted to say. What I really wanted to say is:
Wow, data organization and record keeping is really important for scientists. You'd think with something so important, we would all do it really well, but actually, I think most of us, including me, could do it a lot better. I think part of the reason we don't do it well is a lack of training that applies to the large scale, unstructured level of post-undergraduate projects.
Perhaps the lack of training actually reflects a lack of a consensus on what is good data organization and record keeping. Should we write daily records in our lab notebooks? Or is that overkill? Should data be stored by date or by project? Should we keep all notes, all protocols and results on the path to the final result, all stages of analysis? Or should we trash that stuff as we reach more advanced levels of the project? Should we print copies of the shittiest gels on earth? Or is it OK to leave that out if we think we know why it went wrong? Should we print out programming code that is hundreds of lines long? Is data organization and record keeping too field or project specific to reach a consensus on how to do it well?
So, that's a summary of what I wanted to say, and some of the previous posts expound on the stuff in the 2nd paragraph. There still may be more to come on this topic, but that's it for now, anyway.