Not so long ago, studying gene expression was limited to isolating mRNA, separating on a gel and transfer on a nylon membrane (the good old Northern Blot) and finishing off with hybridization with a radioactive probe specific to your gene of interest. This approach had many limitations:
Slide-based hybridization techniques that started to show up at the beginning of the 1990's changed all this. By inverting the process, using the mRNA as a pool of probes rushing to hybridize themselves on a collection of known gene sequences attached to a solid substrate, you now got access to the whole picture of the mRNAs present in a given sample. Because arrays are always built the same way in an industrial process (assuring, in theory, stability and quality), you could use as many samples as you wanted, the limit being the financial resources of your lab . Since the information that is generated from an array is digital(essentially, emitted light intensity), it is now possible to use software tools to analyze the results at a genome-scale resolution.
To analyze gene expression data, you can use two different ways:
With our experience gathered in teaching the topic, we have observed that there is a need for both. The GUI road allows for a rapid understanding of the principles involved, it can get hard to use if you have a large dataset. The CLI methods on the other end are more flexible because you are free to do whatever you want on large datasets but the learning curve can be steep. This is why we have developed a tutorial serie for each approach.
If you are not using an Impilo server, you need to have the following installed on your machine: