repo.or.cz
/
tika.git
/
search
commit
grep
author
committer
pickaxe
?
search:
re
summary
|
log
|
graphiclog1
|
graphiclog2
|
commit
|
commitdiff
|
tree
|
refs
|
edit
|
fork
first
·
prev
·
next
TIKA-99: Support external parser programs
2008-07-12
ju
k
k
a
TIKA-99
:
Suppo
r
t ext
e
rnal parser pr
o
g
r
ams
commit
|
commitdiff
|
tree
2008-07-09
Jukka Lauri Zitting
TI
K
A-5
4
:
O
ut
l
ook msg p
a
rser
commit
|
commitdiff
|
tree
2008-07-01
Juk
k
a Lau
r
i Zitt
i
n
g
T
I
KA-
1
46: Upgrad
e
to
P
O
I 3
.
1
commit
|
commitdiff
|
tree
2008-07-01
J
uk
k
a
L
auri Z
i
ttin
g
TIKA-146
:
U
pg
r
ad
e
to
POI 3
.
1
commit
|
commitdiff
|
tree
2008-06-18
Ju
k
ka Lauri
Zitti
n
g
TI
K
A-
1
4
5: Separ
a
t
e
NOTICEs and
L
ICENS
E
s
f
o
r
b
in
a
ry
.
.
.
commit
|
commitdiff
|
tree
2008-06-18
Jukka
L
auri Zitting
TIKA-
1
4
4: Upgrade nekoh
t
ml
d
ependenc
y
commit
|
commitdiff
|
tree
2008-06-06
Jukka L
a
uri Zitting
TI
K
A
-
118:
B
ouncycastle binar
i
es
r
equires US exports
.
.
.
commit
|
commitdiff
|
tree
2008-06-06
Jukka L
a
u
r
i Zit
t
ing
t
y
p
o
commit
|
commitdiff
|
tree
2008-06-06
Ju
k
ka Lauri Zitting
TI
K
A-1
1
5: Tika packa
g
e
with
a
ll the
d
epen
d
enc
i
es
commit
|
commitdiff
|
tree
2008-06-06
J
ukka Lauri Zitt
i
ng
TIKA-11
5
:
Tika package with all the dependen
c
ies
commit
|
commitdiff
|
tree
2008-06-06
Jukk
a
Lauri Zitt
i
ng
M
odified
s
vn:ignor
e
to cover thin
g
s like "
.
checkst
y
le"
.
commit
|
commitdiff
|
tree
2008-06-06
J
u
kka La
u
ri Zitting
TIKA-143:
A
d
d
Pars
i
n
g
Reader
commit
|
commitdiff
|
tree
2008-05-06
J
u
kka La
u
r
i Zitting
Simplifi
e
d log4j configuration for unit tes
t
s
commit
|
commitdiff
|
tree
2008-05-06
Ju
k
k
a Lauri
Zitting
TIKA-92: I
m
a
ge metadata
e
x
tract
i
on
commit
|
commitdiff
|
tree
2008-05-05
Jukka
L
auri Zitting
TIKA-
8
7
:
M
i
meType
s
s
ho
u
ld
a
llow mo
d
i
fication of MIME
.
.
.
commit
|
commitdiff
|
tree
2008-04-11
Jukka Lauri Zittin
g
TIKA-139: A
d
d a compo
s
i
t
e p
a
rser
commit
|
commitdiff
|
tree
2008-04-10
J
ukka Lau
r
i Zitting
Replaced tabs
w
i
t
h spaces
in tika-mimetypes
.
xml
commit
|
commitdiff
|
tree
2008-04-10
Jukka Laur
i
Zit
t
ing
T
I
KA-113: Metad
a
ta (such as ti
t
le) should not be part
.
.
.
commit
|
commitdiff
|
tree
2008-04-08
Jukka L
a
u
ri Zitting
TIK
A
-
1
38:
I
gnore
HTM
L
style
and scr
i
pt conte
n
t
commit
|
commitdiff
|
tree
2008-03-28
J
u
kka
L
auri Zitting
TIK
A
-
134: m
v
n package
does not produce pac
k
ages for
.
.
.
commit
|
commitdiff
|
tree
2008-03-28
Jukka Lauri Zit
t
ing
TIKA-12
3
:
S
tru
c
tured
M
S
Office parsing
commit
|
commitdiff
|
tree
2008-03-28
Jukka L
a
uri Z
i
t
ting
TIKA-123: Structured MS Office
pa
r
sing
commit
|
commitdiff
|
tree
2008-03-28
J
u
kka
L
au
r
i
Z
itting
TIKA-132: Refactor Exc
e
l
extr
a
ctor to parse per she
e
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-27
J
ukka Lauri Zitting
Re
f
ormat
t
e
d
NOTICE
to b
e
less ver
b
o
s
e
commit
|
commitdiff
|
tree
2008-03-27
Jukka Lauri Z
i
tting
TIKA-
9
7: Tika GUI
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitting
TIKA-132: Refactor Ex
c
el
e
x
tractor to
p
arse per
s
h
e
e
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitti
n
g
TIKA-13
2
: Refactor Ex
c
el extractor to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Ju
k
k
a Lauri Z
i
tting
TI
K
A-132: R
e
facto
r
Exc
e
l ext
r
a
ctor to pa
r
s
e per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri
Zittin
g
TIKA
-
132:
Refactor Excel extr
a
ctor to
pa
r
se p
e
r s
h
e
et
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitting
TIKA-1
3
2: Refactor Exce
l
e
x
tractor to
parse per she
e
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Juk
k
a L
a
uri Zitting
T
IKA-132: Refactor Excel extr
a
ctor
t
o
p
a
r
se p
e
r sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Juk
k
a Lau
r
i Zitt
i
n
g
TIKA-132: Refactor Excel extra
c
tor to p
a
rse
per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitting
TIKA-132:
R
efactor E
x
cel extr
a
ctor to pars
e
per
s
heet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Ju
k
ka Lauri Zitting
TI
K
A
-132: R
e
factor Excel extra
c
tor to parse per s
h
e
et
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zittin
g
TIKA-1
3
2: Refactor Excel ex
t
ractor to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitt
i
ng
T
I
KA-97: Tika G
U
I
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lau
r
i Zitting
T
IKA-133: Te
e
ContentHandler
cons
t
ructor should use
.
.
.
commit
|
commitdiff
|
tree
2008-03-19
Jukk
a
Lauri Zitting
TIKA
-
128: HTML parse
r
should produc
e
X
HTML
S
AX
e
v
e
n
t
s
commit
|
commitdiff
|
tree
2008-03-19
Jukk
a
Lauri Z
i
tting
T
I
KA-131: Lazy XHTML prefix
g
e
n
eration
commit
|
commitdiff
|
tree
2008-03-18
Ju
k
k
a Lau
r
i Zitting
TI
K
A-
1
30: self
-
or-descendant axis
d
oes n
o
t m
a
tch se
l
f
.
.
.
commit
|
commitdiff
|
tree
2008-03-18
Jukka Lauri Zitting
TIKA-129: node() suppo
r
t for the s
t
reaming XPat
h
utility
commit
|
commitdiff
|
tree
2008-03-09
Jukka La
u
ri Zitting
TIKA-127: Add su
p
p
o
r
t for Vi
s
io fi
l
es
commit
|
commitdiff
|
tree
2008-03-09
Jukka
Lauri Zi
t
ti
n
g
TIKA
-
1
26: Add Pars
e
r
.
p
a
rs
e
(
InputStream
,
Metadata) fo
r
.
.
.
commit
|
commitdiff
|
tree
2008-03-09
Jukka La
u
ri Zi
t
ting
T
I
K
A-123: Structured M
S
Office parsin
g
commit
|
commitdiff
|
tree
2008-03-09
Ju
k
k
a Lau
r
i
Zitting
TIKA-123: Structured
MS Office parsing
commit
|
commitdiff
|
tree
2008-02-19
Jukka Lauri
Z
itting
TIKA-123
:
Structured MS Office
pa
r
sing
commit
|
commitdiff
|
tree
2008-02-19
Jukka Lauri
Zitting
TIKA-12
2
: Use Commons
IO 1
.
4
commit
|
commitdiff
|
tree
2008-02-18
J
uk
k
a Lauri Zi
t
t
ing
TI
K
A-123: Structured MS
Off
i
ce parsing
commit
|
commitdiff
|
tree
2008-02-18
Jukka Lauri Zitting
TI
K
A
-
12
3
: Structured MS Office p
a
r
s
ing
commit
|
commitdiff
|
tree
2008-02-18
Jukk
a
L
a
uri Zitting
TI
K
A-123: Structured MS Of
f
ice parsing
commit
|
commitdiff
|
tree
2008-02-18
Jukka Lauri Zitting
TIKA-103: Exce
l
parsing ignores cell f
o
rm
a
ting
commit
|
commitdiff
|
tree
2008-02-17
J
u
k
ka Lauri Zitting
TIKA-12
3
: Stru
c
t
ure
d
MS Office parsing
commit
|
commitdiff
|
tree
2008-02-17
Jukka La
u
r
i Zitting
TIKA-123
:
Structure
d
MS Offi
c
e parsi
n
g
commit
|
commitdiff
|
tree
2008-02-17
J
uk
k
a
Lauri Zitting
TIKA-1
2
3: Struct
u
red
MS Office parsing
commit
|
commitdiff
|
tree
2008-02-17
J
u
kk
a
Lauri
Zi
t
ting
TIKA-123:
S
tructu
r
ed MS
O
ffice parsing
commit
|
commitdiff
|
tree
2008-01-26
Jukka Lauri
Zi
t
ting
TIKA-118: Bouncy Castle binar
i
es requir
e
US exports
.
.
.
commit
|
commitdiff
|
tree
2008-01-25
Jukka Lauri Zitting
TIKA
-
96: Tika
C
LI
commit
|
commitdiff
|
tree
2008-01-22
Ju
k
ka Lauri Zitting
TIKA-97: Ti
k
a GUI
commit
|
commitdiff
|
tree
2008-01-22
Jukka L
a
uri Zitting
TIKA-97: Tika GUI
commit
|
commitdiff
|
tree
2008-01-22
Jukk
a
Lauri Zitting
T
I
KA-97: Tika GUI
commit
|
commitdiff
|
tree
2008-01-22
Jukka Lau
r
i Zitting
TIKA-
9
7
: Ti
k
a GUI
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lauri Z
i
tt
i
n
g
TIKA-115:
T
ika package with all the d
e
p
ende
n
ci
e
s
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lau
r
i Zitt
i
ng
TIKA-11
7
: Drop JDOM a
n
d J
a
x
e
n dep
e
ndencies
commit
|
commitdiff
|
tree
2008-01-21
Jukka
Lauri Zi
t
ting
TIKA
-
1
16
:
Streaming parser fo
r
Op
e
nDocum
e
nt files
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lauri Zitting
TIKA
-
10
9
:
WordParser fails on som
e
Word fi
l
es
commit
|
commitdiff
|
tree
2008-01-20
Jukka Lauri Zitti
n
g
TIKA-105: Ex
c
el parser impl
e
mentation b
a
sed on
PO
I
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Jukka Lauri Zitting
TIKA-105: Excel parse
r
implementa
t
ion
ba
s
ed
on
POI
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Jukka Lauri Zitting
T
I
K
A-109: Wo
r
dPars
e
r fail
s
o
n some Word file
s
commit
|
commitdiff
|
tree
2007-12-31
Jukk
a
Lauri Zitting
pom
.
xml:
U
pdated trunk version to
0
.
2-SNAPSHOT
commit
|
commitdiff
|
tree
2007-12-26
J
ukk
a
Lauri
Z
i
t
t
ing
TIKA-111: Missing license hea
d
ers
commit
|
commitdiff
|
tree
2007-12-26
Jukka Lauri Zitting
TI
K
A-
1
10: Add KEYS
file for Tika
commit
|
commitdiff
|
tree
2007-12-21
Ju
k
ka Laur
i
Zitting
TIKA-105 - Exce
l
parser im
p
lementat
i
on based o
n
P
OI
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Juk
k
a Lauri Zi
t
ti
n
g
TIKA-106 - Remove dependency
o
n
Jakarta ORO
- use
JDK
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka Lauri Zitting
TIKA-104 - Add util
i
ty metho
d
s to throw IOException
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka Lauri Zitting
TIKA-10
7
- R
e
move use of asser
t
i
o
ns for argu
m
en
t
check
i
n
g
commit
|
commitdiff
|
tree
2007-11-25
Jukka Lau
r
i Zitti
n
g
T
IKA-102 - Parser implem
e
ntations
l
oading a large amount
.
.
.
commit
|
commitdiff
|
tree
2007-11-25
Jukka
L
auri Z
i
tting
T
IK
A
-
102 - P
a
rser imple
m
entatio
n
s
l
oading a large
a
mount
.
.
.
commit
|
commitdiff
|
tree
2007-11-20
Jukka
Lauri
Zitti
n
g
TIKA-91: Add pro
p
er
a
t
tribution fo
r
code from textmining
.
o
r
g
commit
|
commitdiff
|
tree
2007-11-13
Jukka Lauri Zitting
TIKA-100 -
S
tructured
P
DF pa
r
s
ing
commit
|
commitdiff
|
tree
2007-11-06
Jukka Lauri Zi
t
tin
g
TIKA-87
-
M
imeTypes should
allow
modification
of MI
M
E
.
.
.
commit
|
commitdiff
|
tree
2007-11-05
J
ukka L
a
uri Zit
t
ing
TIKA-87
-
MimeTyp
e
s
shoul
d
allo
w
mo
d
ific
a
t
ion of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-04
Ju
k
ka
L
a
uri
Z
itting
TIK
A
-87 - Mim
e
Ty
p
e
s should allow modificati
o
n of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
J
u
k
ka Lauri Zitting
TIKA-87 - MimeTypes
should allow modific
a
tion of
M
IME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Juk
k
a Lauri Zitting
TIKA-87 - Mi
m
e
Types shou
l
d
allow modification of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-10-23
Jukka La
u
ri Zitting
TIKA-87
- MimeTypes sh
o
uld
allow m
o
di
f
ication of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Jukka Lau
r
i
Z
itting
T
I
KA-85
- Add
g
lob patterns
f
rom the ASF
svn:e
o
l-style
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Jukka Lauri Zitting
TIKA-84 -
Add Mime
T
ypes
.
getMimeType(Inp
u
tS
t
ream)
commit
|
commitdiff
|
tree
2007-10-19
Ju
k
ka Lauri Zitting
TIKA-84
- Add M
i
meTypes
.
getMim
e
Type(InputStream)
commit
|
commitdiff
|
tree
2007-10-19
Jukka L
a
uri Zitting
TIKA-83 - Create a org
.
apac
h
e
.
t
i
ka
.
sax
p
a
ckage f
o
r
.
.
.
commit
|
commitdiff
|
tree
2007-10-18
J
uk
k
a Lau
r
i Zitt
i
ng
S
e
t
svn:eol-style to nat
i
ve
commit
|
commitdiff
|
tree
2007-10-18
Jukka Lauri
Zitting
Correct indenting (fou
r
spaces instead
of one as the
.
.
.
commit
|
commitdiff
|
tree
2007-10-16
Jukka La
u
ri
Zitting
TIKA-71 - R
e
mov
e
P
a
r
s
e
rConfi
g
a
n
d ParserFactory
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Zitting
Removed
an extra debu
g
print
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Zitting
TIKA-
7
0
-
Better MIME information for the O
p
en D
o
cume
n
t
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
Jukka La
u
ri Z
i
tting
T
I
KA-70 - Bette
r
MIME info
r
mation for the Op
e
n
D
ocument
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
J
ukka Lau
r
i Z
i
tting
T
I
KA
-
67
- Add an auto-dete
c
ting Parse
r
implementation
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Zi
t
ting
TI
K
A-68 - Add dummy p
a
rse
r
classes t
o
be
used as sent
i
nels
commit
|
commitdiff
|
tree
2007-10-14
Jukka Lauri
Z
itting
TIKA-66
- Use
J
ava 5 features in org
.
a
pache
.
tika
.
mime
commit
|
commitdiff
|
tree
2007-10-14
Jukka L
a
uri Zitting
TIKA
-
63 - Av
o
id
multi
p
le passes
over the input
s
tream
.
.
.
commit
|
commitdiff
|
tree
2007-10-14
Jukka Lauri Z
i
tting
TIKA-60 - R
e
name
Micr
o
soft parser
cla
s
ses
commit
|
commitdiff
|
tree
next