Creating a program computing intron sizes


The final program that you will have to write is expected to do the following things:

  • Load a gtf file (e.g. this one)
  • Compute the size of introns.
  • Draw the distribution of intron size.


  • If needed/possible, use regular expression (e.g. to extract transcript name).
  • Use an argument parser so that one can call it from command line.
  • The color of the diagram should be proposed as an argument to the parser.

Note about regular expressions

In the regular expression below ([^"]+) the parentheses are used to capture a motif (here, a succession of characters that are different from ‘"’. The found motif can be recovered using the group() method.

## tx1