2 #@ This is the set of MIME types that will become compiled in.
3 #@ Official MIME types are registered at and managed by IANA.
4 #@ A more complete list of actually used MIME types can be found at
5 #@ <http://svn.apache.org/viewvc/tika/trunk/tika-core/src/main/resources/\
6 #@ org/apache/tika/mime/tika-mimetypes.xml>.
7 #@ This adds some more, search EXTENSION comments below.
8 #@ It also changes some TIKA things, search CHANGE comments below.
10 # Lines starting with a number sign (#) (after possible whitespace) are
11 # comments and discarded, empty lines are ignored, remaining ones are
12 # interpreted according to the scheme
13 # [TYPEMARKER ]MIMETYPE whitespace EXTENSION(S, whitespace separated)
14 # MIMETYPE may occur several times, values are joined.
17 # - Follow lines may be used: instead of repeating MIMETYPE, start the next
18 # line with whitespace. (Intervening comment lines may be used.)
19 # - TYPEMARKER: an at sign (@) will tag the MIME type as plain text (unless
20 # the user installs a specific handler for the type in question).
21 # The meaning is identical to what is documented for command `mimetypes'.
23 # Syntax restrictions:
24 # - Comments must be lines of their own.
25 # - Only MIME types for which we have a type constant allocated in
26 # mime-types.c:enum mime_type are allowed in this file here.
27 # - MIME types without any file extensions are not handled.
28 # - Note that the order of types and extensions is preserved ...
29 # - Note that at least one built-in type is required (empty C array)
31 # Expected frequent use
32 # text/plain CHANGE: m4->text/x-m4, pod->text/x-pod; rc<-
33 @ text/plain txt text conf cfg def list log in properties rc
34 @ text/x-diff diff patch
35 @ text/troff t tr roff man me ms tmac 1 2 3 4 5 6 7 8 9
37 @h@ application/xhtml+xml xhtml xht
38 @h@ text/html html htm
39 application/xml xml xsl xsd
41 @ application/x-sh sh bash
45 @ text/x-c++hdr hpp hxx hh h++ hp
46 @ text/x-c++src cpp cxx cc c++
48 @ text/x-perl pl pm al perl
49 # CHANGE (tika: text/plain)
53 application/x-xz xz txz
55 application/x-lzma lzma tlz
56 application/zstd zst tzst
57 application/gzip tgz gz emz
61 application/x-bzip2 bz2 tbz2 boz
66 application/pgp-encrypted pgp
67 @q@ application/pgp-signature sig asc
68 application/pkcs7-mime p7m p7c
69 @q@ application/pkcs7-signature p7s
70 application/postscript ps eps epsf epsi
74 image/jpeg jpg jpeg jpe jif jfif jfi
79 # Remains alphabetically
80 @ application/javascript js
81 @ application/json json
82 @ application/mbox mbox
83 application/rdf+xml rdf owl
84 application/rss+xml rss
86 application/x-bcpio bcpio
87 application/x-bittorrent torrent
88 application/x-bzip bz tbz
89 application/x-compress z
90 application/x-cpio cpio
91 @ application/x-csh csh tcsh
93 @ application/x-latex latex
94 application/x-shar shar
95 @ application/x-tex tex
96 @ application/x-texinfo texinfo texi
97 application/x-x509-ca-cert der crt
98 application/x-xfig fig
99 application/xml-dtd dtd
100 application/xquery xq xquery
101 application/xslt+xml xslt
103 audio/midi mid midi kar rmi
104 audio/mp4 mp4a m4a m4b
105 audio/mpeg mpga mp2 mp2a mp3 m2a m3a
111 audio/x-pn-realaudio ram ra
113 image/svg+xml svg svgz
115 image/x-ms-bmp bmp dib
119 image/x-xwindowdump xwd
120 message/rfc822 eml mime mht mhtml
121 @ text/calendar ics ifb
125 @ text/x-assembly s S asm
126 # (Note that the tuple extensions will never match since text/plain wins!)
127 @ text/x-asciidoc asciidoc adoc ad ad.txt adoc.txt
129 # (I have assembler .cgi's written myself, don't "@ " mark that one)
138 # CHANGE (tika: text/plain)
141 @ text/x-php php php3 php4
143 @ text/x-rst rst rest restx
146 @ text/x-vcalendar vcs
148 @ text/markdown md markdown