6 # Process STDIN (the interpro2go file from geneontology.org
7 # For each line, output to STDOUT the following:
8 # sourceaccession (IPR),targetaccession (GO)
9 # where dbid is the human-readable non-unique IPR reference,
10 # dbaccession is the unique IPR reference, and goaccession is the
11 # unique GO accession.
13 # Interpro lines look like this:
14 # InterPro:IPR000018 P2Y4 purinoceptor > GO:purinergic nucleotide receptor activity, G-protein coupled ; GO:0045028
17 s/^InterPro:(IPR\d+).*>.*;\s+(GO:\d+)/$1,$2/;