repo.or.cz
/
tika.git
/
search
commit
grep
author
committer
pickaxe
?
search:
re
summary
|
log
|
graphiclog1
|
graphiclog2
|
commit
|
commitdiff
|
tree
|
refs
|
edit
|
fork
first
·
prev
·
next
TIKA-115: Tika package with all the dependencies
2008-06-06
Jukka Lauri Zitting
TIKA-115: Tika package w
i
th all the
dependencies
commit
|
commitdiff
|
tree
2008-06-06
J
u
kka
L
aur
i
Z
i
tt
i
ng
TIKA-11
5
:
Ti
k
a package with
all the depend
e
ncies
commit
|
commitdiff
|
tree
2008-06-06
Jukka Lauri Zit
t
ing
M
od
i
fied svn:ignore to c
o
ver things like "
.
checkstyl
e
"
.
commit
|
commitdiff
|
tree
2008-06-06
J
u
kka L
a
uri
Z
itting
TIKA-143: Add Parsi
n
gReader
commit
|
commitdiff
|
tree
2008-05-06
Jukk
a
Lauri Zitting
Simplif
i
ed log
4
j config
u
r
a
tion for unit tests
commit
|
commitdiff
|
tree
2008-05-06
Jukka L
a
uri Zittin
g
T
I
K
A-92: Image meta
d
a
t
a ext
r
action
commit
|
commitdiff
|
tree
2008-05-05
J
u
k
ka Lauri
Zittin
g
TIKA
-
8
7: MimeTypes should allow modifi
c
at
i
on of MIME
.
.
.
commit
|
commitdiff
|
tree
2008-04-11
Jukka
L
auri Zitting
T
IKA-1
3
9: Ad
d
a composite parser
commit
|
commitdiff
|
tree
2008-04-10
J
u
kka L
a
uri
Zitting
Replaced
tabs with spaces in tika-mimetypes
.
xml
commit
|
commitdiff
|
tree
2008-04-10
Jukka La
u
ri
Zitting
TI
K
A
-113: M
e
t
adata (such as tit
l
e) should not be
p
a
r
t
.
.
.
commit
|
commitdiff
|
tree
2008-04-08
Jukka Lauri Z
i
t
t
ing
TIKA-13
8
: Ignore HT
M
L style and script content
commit
|
commitdiff
|
tree
2008-03-28
Jukk
a
Lauri Zittin
g
T
I
K
A-134
:
m
v
n
pa
c
kage does not produce pa
c
kages fo
r
.
.
.
commit
|
commitdiff
|
tree
2008-03-28
Jukka Lauri Zitting
TIKA-123: Structured MS
O
ffi
c
e parsing
commit
|
commitdiff
|
tree
2008-03-28
J
ukka
L
auri
Z
i
tting
TIKA-
1
23: St
r
uct
u
r
e
d
MS Office parsing
commit
|
commitdiff
|
tree
2008-03-28
Jukka
L
auri
Zitti
n
g
T
IKA
-
132: Refactor
Ex
c
el extractor to
pars
e
per
sh
e
e
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-27
Jukka Lauri Zit
t
ing
Refo
r
ma
t
ted N
O
TICE to b
e
less verbose
commit
|
commitdiff
|
tree
2008-03-27
Jukka Lauri Zitting
TIKA-
9
7: Tika GUI
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitting
TIKA-132: Refactor Excel extractor to parse p
e
r
s
heet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka
Lauri Zit
t
i
ng
TIKA-1
3
2:
R
efactor Exce
l
extractor to parse p
e
r
s
he
e
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zi
t
t
ing
TIKA-132: Re
f
actor Ex
c
e
l
extractor
to
p
arse per
she
e
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Ju
k
k
a
La
u
ri Zitti
n
g
TIKA-132:
R
efactor Excel extract
o
r to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka L
a
u
ri
Z
i
tt
i
ng
TIKA-13
2
:
R
e
factor Ex
c
e
l e
x
t
ractor to pa
r
s
e per s
h
eet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Z
i
t
t
ing
TIKA-132: Refactor Excel extracto
r
t
o parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka L
a
uri
Zitting
TIKA-132: Refactor
Ex
c
el extractor t
o
p
a
r
se
p
er sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Ju
k
ka Lauri
Z
i
t
t
ing
TIKA-132: Refactor Excel ex
t
r
actor to parse per shee
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
J
ukka Lauri Zit
t
ing
TI
K
A-13
2
: R
e
fa
c
tor Excel ex
t
r
a
cto
r
to pa
r
se
p
er sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Z
i
tting
TIKA-132:
R
efactor
E
x
cel extractor to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Juk
k
a
Lauri Zitting
TIKA-97: Ti
k
a
GUI
commit
|
commitdiff
|
tree
2008-03-26
Jukka
Lauri Zit
t
ing
TIKA-133: TeeConten
t
Handl
e
r constructor should use
.
.
.
commit
|
commitdiff
|
tree
2008-03-19
J
u
k
k
a Lauri
Z
it
t
ing
TI
K
A-12
8
: H
T
ML pa
r
ser sho
u
ld produ
c
e XHTML SA
X
event
s
commit
|
commitdiff
|
tree
2008-03-19
Jukka Lau
r
i Zitting
TI
K
A-13
1
: Lazy XHTML pref
i
x generat
i
on
commit
|
commitdiff
|
tree
2008-03-18
Jukka Lauri Zit
t
ing
TIKA-130: self-or-de
s
cendant axis does
n
o
t
match
s
e
l
f
.
.
.
commit
|
commitdiff
|
tree
2008-03-18
Jukka Lauri Zitting
T
I
KA-129: node() support f
o
r the stre
a
mi
n
g XPath
u
t
ility
commit
|
commitdiff
|
tree
2008-03-09
J
u
k
ka Lauri Zitting
TIKA-1
2
7:
Add support for
V
i
sio
f
i
les
commit
|
commitdiff
|
tree
2008-03-09
Jukka Lauri Zi
t
ting
TI
K
A-126
:
Add P
a
rser
.
pa
r
se(Inpu
t
Stream, Metada
t
a) for
.
.
.
commit
|
commitdiff
|
tree
2008-03-09
Jukka L
a
uri Zit
t
ing
TI
K
A-123: Str
u
ctured MS
O
ff
i
c
e parsing
commit
|
commitdiff
|
tree
2008-03-09
Jukka L
a
uri Z
i
tting
TIKA-123: Str
u
ct
u
r
e
d MS Offi
c
e
p
arsing
commit
|
commitdiff
|
tree
2008-02-19
Jukka Lauri Zitti
n
g
TIKA-123: St
r
ucture
d
MS Office
p
arsing
commit
|
commitdiff
|
tree
2008-02-19
Jukka
L
a
uri Z
i
tting
TIKA-1
2
2
:
Use Commons IO 1
.
4
commit
|
commitdiff
|
tree
2008-02-18
Jukka
La
u
ri Zit
t
ing
TIKA-123: Struct
u
red MS Office parsin
g
commit
|
commitdiff
|
tree
2008-02-18
Jukka La
u
ri Zit
t
ing
TIKA
-
123: Structured MS
O
ffice
p
arsing
commit
|
commitdiff
|
tree
2008-02-18
Jukka Lauri Zitting
TIK
A
-123: Struct
u
red MS Office par
s
ing
commit
|
commitdiff
|
tree
2008-02-18
Ju
k
ka Lau
r
i
Zitting
T
IKA-103: Exc
e
l parsin
g
ignore
s
ce
l
l formating
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lauri Z
i
tting
TIKA
-
123: Structured MS O
f
fice parsi
n
g
commit
|
commitdiff
|
tree
2008-02-17
J
u
k
ka La
u
ri Zitt
i
ng
T
IKA-123: Structured MS Office pa
r
sing
commit
|
commitdiff
|
tree
2008-02-17
Jukk
a
Lauri Zitti
n
g
TIKA-123: Struct
u
red M
S
Offi
c
e parsing
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lauri Zitting
TIK
A
-
1
23
:
Struct
u
red MS
O
ffice
p
a
r
s
i
ng
commit
|
commitdiff
|
tree
2008-01-26
Ju
k
ka La
u
ri
Zitting
TIK
A
-118
:
Bouncy
Ca
s
tl
e
binari
e
s
r
e
quire US exports
.
.
.
commit
|
commitdiff
|
tree
2008-01-25
Jukka
L
auri Zitting
TI
K
A-96: Tika CLI
commit
|
commitdiff
|
tree
2008-01-22
J
ukka Lauri Zitting
TIKA-97:
Tika GUI
commit
|
commitdiff
|
tree
2008-01-22
Jukka Lauri Zitting
T
IKA-9
7
:
Tika GUI
commit
|
commitdiff
|
tree
2008-01-22
J
u
kka La
u
ri Zitti
n
g
TIKA-97: Tika GUI
commit
|
commitdiff
|
tree
2008-01-22
J
u
k
k
a
Lauri Zi
t
t
ing
TIKA-97: Tika
GUI
commit
|
commitdiff
|
tree
2008-01-21
Jukka La
u
ri Zi
t
t
ing
TIKA-115: Tika package with
all
the dependencies
commit
|
commitdiff
|
tree
2008-01-21
Jukka
L
a
u
ri Zitting
TIK
A
-117: Drop
J
DOM a
n
d Jaxen depend
e
ncies
commit
|
commitdiff
|
tree
2008-01-21
Juk
k
a Lauri Zitting
TIKA-116
:
Streaming parser
for
O
pen
D
o
cument files
commit
|
commitdiff
|
tree
2008-01-21
J
ukka Laur
i
Z
itti
n
g
TIKA-109: WordPa
r
se
r
fa
i
ls on some
Word fil
e
s
commit
|
commitdiff
|
tree
2008-01-20
Jukka L
a
uri Zitti
n
g
TIKA-105
:
Excel parser implementatio
n
b
ased on POI
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Juk
k
a Lauri Zitting
TI
K
A-1
0
5: Excel
p
arser implementation
based on POI
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Jukka Lauri Zitting
T
I
KA-1
0
9: WordParser
f
a
ils on som
e
Word f
i
les
commit
|
commitdiff
|
tree
2007-12-31
Jukka Lauri Zi
t
ting
pom
.
xml
:
Update
d
tru
n
k version to
0
.
2
-
S
N
A
PS
H
OT
commit
|
commitdiff
|
tree
2007-12-26
Jukka
Lauri Zittin
g
T
I
KA
-
111:
Miss
i
ng
l
icense head
e
rs
commit
|
commitdiff
|
tree
2007-12-26
Jukka
L
auri Z
i
tting
T
I
KA-1
1
0
:
A
d
d KEYS fil
e
for Tika
commit
|
commitdiff
|
tree
2007-12-21
Jukka Laur
i
Zitting
TIKA-105 -
Excel parser implementation base
d
on
POI
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
J
ukka L
a
uri Zitting
TIKA
-
10
6
- Remove
dependency on Jakart
a
O
R
O
- use JDK
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka Lauri Z
i
tting
TIKA-104 - Add
u
ti
l
ity methods to
throw I
O
Ex
c
eption
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka Lauri Zitting
TIKA-
1
07
- Remove use of
a
ssertions for argument ch
e
cking
commit
|
commitdiff
|
tree
2007-11-25
Jukka Lauri Zitting
TI
K
A-102 -
P
a
rser
imp
l
ementa
t
ion
s
l
oa
d
ing
a large amo
u
n
t
.
.
.
commit
|
commitdiff
|
tree
2007-11-25
J
ukka Laur
i
Zitting
TIKA-
1
02 - Parse
r
implem
e
ntations loading a large amou
n
t
.
.
.
commit
|
commitdiff
|
tree
2007-11-20
J
u
kk
a
Lauri Zi
t
t
i
ng
TIKA-9
1
: Add pro
p
er attribution fo
r
code from
t
extmining
.
org
commit
|
commitdiff
|
tree
2007-11-13
Jukka Lauri Zitting
TIKA-100 - Structured PDF parsing
commit
|
commitdiff
|
tree
2007-11-06
J
ukka Lauri Zitt
i
ng
TIKA-8
7
- MimeType
s
should allow mo
d
ification of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-05
J
ukka Lauri Zi
t
ting
TIK
A
-87 - MimeTypes shoul
d
allow modification of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-04
Jukk
a
Lauri Zitt
i
ng
T
I
KA-87 - MimeTypes should a
l
l
ow mod
i
fication of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Jukka
L
auri Zitting
TIKA-
8
7 -
MimeTy
p
es should allow
m
o
d
ificat
i
on of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Jukka Lauri Z
i
tting
TIKA-87 - MimeTypes should
allow mod
i
f
ication of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-10-23
Jukka Lauri
Zitt
i
ng
TIKA-
8
7 - MimeTypes
s
hould
a
llow modification of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Jukk
a
Lauri
Z
itting
TIKA
-
85 -
A
dd gl
o
b patterns from the ASF
s
v
n:eol-sty
l
e
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Jukka Lauri Zitting
TI
K
A
-
8
4 - Add MimeTypes
.
getMimeTy
p
e(Inp
u
tSt
r
eam)
commit
|
commitdiff
|
tree
2007-10-19
Jukka Lauri Zitting
TIKA-
8
4 - Add
M
imeTypes
.
g
et
M
imeType(I
n
p
utStream
)
commit
|
commitdiff
|
tree
2007-10-19
Juk
k
a
L
auri Zi
t
ti
n
g
TIKA-83 - Crea
t
e a org
.
apache
.
tika
.
s
a
x package for
.
.
.
commit
|
commitdiff
|
tree
2007-10-18
Jukka Lauri Zit
t
ing
Se
t
s
v
n:eol-style to
native
commit
|
commitdiff
|
tree
2007-10-18
J
u
kka Lauri Zitting
Correct indenting (four
s
paces instead of
one as the
.
.
.
commit
|
commitdiff
|
tree
2007-10-16
Jukka L
a
u
ri Zitting
TIKA-71 - Remove ParserConfig an
d
ParserFactory
commit
|
commitdiff
|
tree
2007-10-15
Jukka
Laur
i
Zitt
i
ng
Removed an extra debug
print
commit
|
commitdiff
|
tree
2007-10-15
Jukka
L
a
u
ri
Z
itting
TIKA-70 - Better MIME information
for the
Open Document
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
Juk
k
a Lauri Zit
t
i
n
g
TIKA-70 - Better M
I
ME i
n
formatio
n
for the Open Documen
t
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lau
r
i Zitti
n
g
TIKA-67 - Add an auto-detect
i
ng P
a
rser
implementation
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Zi
t
ting
TIKA-68
-
A
dd dum
m
y parser classes to be u
s
ed
a
s senti
n
els
commit
|
commitdiff
|
tree
2007-10-14
Jukka
Lauri Zitting
TIKA-66
-
Us
e
J
ava
5 featur
e
s in
o
rg
.
apache
.
tika
.
m
i
me
commit
|
commitdiff
|
tree
2007-10-14
J
ukk
a
Lauri Zi
t
ting
TIKA-6
3
- Avoid multipl
e
passes over the input
str
e
am
.
.
.
commit
|
commitdiff
|
tree
2007-10-14
J
ukka L
a
uri Zitting
TIKA-60 - Rename Micr
o
s
oft pa
r
se
r
classes
commit
|
commitdiff
|
tree
2007-10-14
Jukka Lau
r
i Zit
t
ing
T
I
KA-60 - Rename
M
icr
o
soft parser classes
commit
|
commitdiff
|
tree
2007-10-13
Jukka Lau
r
i Zitting
T
IKA-62 - Use
TikaConfig
.
getDefaultConfig() in
s
tead
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
Jukka Lauri Zitt
i
ng
TIKA-57 - Rename o
r
g
.
apache
.
t
ika
.
ms to org
.
apache
.
ti
k
a
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
Jukka Lau
r
i Zitting
TIKA-53 - XH
T
M
L
SAX e
v
ents f
r
om
p
arsers
commit
|
commitdiff
|
tree
2007-10-10
Jukka
L
aur
i
Zitt
i
n
g
TI
K
A-40 - Tika needs to sup
p
ort diverse character
encodin
g
s
commit
|
commitdiff
|
tree
2007-10-08
J
ukka
L
au
r
i Zitting
TIKA-41 - Resour
c
e fi
l
es oc
c
u
r
twice in jar file
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Zittin
g
TI
K
A-45 - Reread
a
bleInputStream needs
t
o
be able to
.
.
.
commit
|
commitdiff
|
tree
2007-10-07
Jukka Laur
i
Zitting
TIKA-
4
8 - Merge MS Extractors and Parse
r
s
commit
|
commitdiff
|
tree
next