repo.or.cz
/
tika.git
/
search
commit
grep
author
committer
pickaxe
?
search:
re
summary
|
log
|
graphiclog1
|
graphiclog2
|
commit
|
commitdiff
|
tree
|
refs
|
edit
|
fork
first
·
prev
·
next
TIKA-113: Metadata (such as title) should not be part of content
2008-04-10
Jukka
Lauri Zi
t
ting
TIK
A
-1
1
3:
M
et
a
d
at
a
(such as ti
t
le) s
h
ould not be part
.
.
.
commit
|
commitdiff
|
tree
2008-04-08
Ju
k
ka L
a
uri Zitting
TI
K
A-138
:
I
g
nore HTML style
and
s
cr
i
pt content
commit
|
commitdiff
|
tree
2008-03-28
Jukka Lauri Z
i
tting
TIKA-134: mvn
package do
e
s
n
ot pro
d
u
c
e pa
c
kag
e
s for
.
.
.
commit
|
commitdiff
|
tree
2008-03-28
Jukka Lauri
Z
itting
TIKA-123: S
t
r
u
ctured M
S
Office
p
arsi
n
g
commit
|
commitdiff
|
tree
2008-03-28
J
u
kk
a
Lauri Zit
t
ing
TIKA
-
123: Structured MS Office parsing
commit
|
commitdiff
|
tree
2008-03-28
J
ukka L
a
uri Zitting
T
I
KA-13
2
: Refactor Excel e
x
trac
t
or to parse p
e
r sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-27
Jukka Lauri Zitting
Reformatte
d
NOTICE to be less verbose
commit
|
commitdiff
|
tree
2008-03-27
Ju
k
ka Lauri Zitting
TIKA
-
97: Tika
GUI
commit
|
commitdiff
|
tree
2008-03-26
Ju
k
ka
Lauri
Zitting
TIKA-132:
R
efactor Excel extra
c
to
r
t
o
parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitt
i
ng
TIKA
-
132: Refactor Excel extractor
to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri
Z
itting
TIKA-1
3
2: R
e
factor Excel extr
a
ctor to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri
Zitting
TIKA-132: Refa
c
tor Excel ex
t
ractor to parse per
s
h
e
e
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
J
u
kk
a
L
a
uri Zit
t
i
n
g
TI
K
A-132: Refactor
E
x
c
el extracto
r
to p
a
rse
p
er sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitting
T
I
KA-132: Refa
c
tor Excel e
x
tractor
to par
s
e
per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka L
a
ur
i
Z
itting
TI
K
A-1
3
2:
R
ef
a
ctor Excel extractor to par
s
e
per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lau
r
i Zit
t
ing
TIKA-13
2
: Re
f
actor Exc
e
l extractor to
p
arse
p
e
r s
h
ee
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zit
t
i
n
g
TIKA-132:
R
ef
a
ctor Excel extractor to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitti
n
g
TIKA-13
2
: Re
f
actor Excel e
x
tractor to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
J
ukka
Lauri
Zi
t
ting
TIKA-97: Tika GUI
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zi
t
ting
TIKA-13
3
: TeeCont
e
ntH
a
nd
l
er
c
onstructor should use
.
.
.
commit
|
commitdiff
|
tree
2008-03-19
J
ukk
a
L
auri Zitt
i
ng
TIKA-128
:
H
T
ML parser s
h
ould produce XHTM
L
SAX eve
n
t
s
commit
|
commitdiff
|
tree
2008-03-19
Ju
k
ka Lauri Zittin
g
TI
K
A
-1
3
1:
L
azy XHTML pre
f
ix generation
commit
|
commitdiff
|
tree
2008-03-18
J
ukka Lauri
Zitting
TIKA
-
1
3
0
: se
l
f-or-de
s
cendant
axis does not ma
t
ch self
.
.
.
commit
|
commitdiff
|
tree
2008-03-18
Jukka L
a
uri Z
i
t
t
in
g
TIKA-129: node(
)
s
u
pport for t
h
e str
e
aming XPath
u
t
i
lity
commit
|
commitdiff
|
tree
2008-03-09
Jukka Lauri
Z
ittin
g
TIKA-
1
27: Add
s
upport for Visio files
commit
|
commitdiff
|
tree
2008-03-09
J
u
kka Lau
r
i Zitting
TIKA-126: Ad
d
Parse
r
.
p
a
rs
e
(InputStrea
m
, Metadata) for
.
.
.
commit
|
commitdiff
|
tree
2008-03-09
Jukk
a
Lauri
Zitting
TIKA-123: Structu
r
ed MS
Office pa
r
sing
commit
|
commitdiff
|
tree
2008-03-09
Jukka La
u
ri Zittin
g
T
I
KA-123
:
S
truc
t
ur
e
d M
S
Office parsing
commit
|
commitdiff
|
tree
2008-02-19
Juk
k
a La
u
ri Zitting
TIKA-
1
2
3
: Struc
t
u
r
e
d
MS Office
parsin
g
commit
|
commitdiff
|
tree
2008-02-19
Jukk
a
Lauri Zitting
TIKA-122
:
Use Com
m
o
n
s IO 1
.
4
commit
|
commitdiff
|
tree
2008-02-18
Jukk
a
Lauri Zitting
TIKA-
1
23: Struc
t
ured MS Office par
s
ing
commit
|
commitdiff
|
tree
2008-02-18
Jukka Lauri Zi
t
ting
TIKA-123: St
r
uctured MS Office parsing
commit
|
commitdiff
|
tree
2008-02-18
Jukka Lauri Zi
t
ting
T
IKA-1
2
3:
S
t
ructured MS Office parsing
commit
|
commitdiff
|
tree
2008-02-18
Jukka
L
auri Zitting
T
I
K
A
-103: Excel parsing
i
gn
o
res cell
f
ormating
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lauri Zit
t
i
ng
TIKA-123: Struc
t
ured M
S
Office par
s
ing
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lauri Zitting
TIKA-12
3
: Structured MS Office parsin
g
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lauri Z
i
tting
TI
K
A-1
2
3: Struc
t
ured MS Off
i
ce par
s
ing
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lauri Zitting
TIKA-123: Structured MS Of
f
ic
e
parsi
n
g
commit
|
commitdiff
|
tree
2008-01-26
Jukka
L
a
uri Zit
t
ing
TIKA-
1
18: Boun
c
y Cas
t
le binaries requi
r
e
US expo
r
ts
.
.
.
commit
|
commitdiff
|
tree
2008-01-25
J
uk
k
a Lauri Zi
t
ting
TIKA-96: T
i
ka CLI
commit
|
commitdiff
|
tree
2008-01-22
Jukk
a
Lauri Zitting
TIKA-97: Tika GU
I
commit
|
commitdiff
|
tree
2008-01-22
Jukka La
u
r
i
Z
i
tting
TIKA-97:
Tika GUI
commit
|
commitdiff
|
tree
2008-01-22
Jukka Lauri
Zitting
TIKA-97: Tik
a
GU
I
commit
|
commitdiff
|
tree
2008-01-22
Jukka Lauri Zi
t
ting
TIKA-
9
7: Tika GUI
commit
|
commitdiff
|
tree
2008-01-21
J
ukka Lau
r
i
Zitti
n
g
TIKA-1
1
5
:
Ti
k
a package w
i
th all th
e
depend
e
ncies
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lauri
Zitt
i
n
g
TIKA-117: Drop JDOM and Jaxen dependencies
commit
|
commitdiff
|
tree
2008-01-21
J
u
kka Lauri Zittin
g
TIKA-116: Strea
m
ing p
a
rser for OpenDocument files
commit
|
commitdiff
|
tree
2008-01-21
J
u
kka
L
auri Zittin
g
TIKA-109: W
o
rd
P
a
rser fails
on some Word files
commit
|
commitdiff
|
tree
2008-01-20
Jukka Lauri
Z
itting
TIKA-105: Excel pa
r
ser imple
m
entation
based on PO
I
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Jukka Lauri Zi
t
t
i
n
g
T
IKA-105: Exce
l
parser
i
mplementation
b
ased
o
n POI
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Jukka
L
auri Zitting
TIKA-109:
W
ordP
a
rser fails
on so
m
e
Word files
commit
|
commitdiff
|
tree
2007-12-31
J
ukka Lauri Zitting
pom
.
xml: Upda
t
e
d
t
runk v
e
rsio
n
to 0
.
2-SNAPSHOT
commit
|
commitdiff
|
tree
2007-12-26
Jukka
Lauri Zi
t
ting
TIKA-111:
Missi
n
g l
i
cense
h
e
a
d
e
r
s
commit
|
commitdiff
|
tree
2007-12-26
Jukka Lauri Z
i
tting
TIKA-11
0
:
Ad
d
KE
Y
S fi
l
e for Tika
commit
|
commitdiff
|
tree
2007-12-21
Jukka Lauri Zitt
i
ng
TI
K
A-1
0
5
-
E
xcel parser impl
e
men
t
a
t
ion bas
e
d on POI
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
J
ukka
Lauri Zitti
n
g
TIK
A
-1
0
6
-
Remove depen
d
ency
on Jakarta ORO
- use JDK
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Ju
k
ka
L
auri Zi
t
ting
T
I
KA-104 - Add utility methods to throw IOException
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka Lauri Zitti
n
g
TIKA-10
7
- Remove use of as
s
e
rti
o
ns
for argument checking
commit
|
commitdiff
|
tree
2007-11-25
Jukka Laur
i
Z
i
tting
TIKA-102
- Parser implement
a
ti
o
ns loading a large amount
.
.
.
commit
|
commitdiff
|
tree
2007-11-25
Jukka Lau
r
i
Z
i
tting
TIKA-102 - Parser implemen
t
a
t
ions loa
d
ing a large amo
u
nt
.
.
.
commit
|
commitdiff
|
tree
2007-11-20
Jukk
a
Lauri Zitting
TIKA-91: Add proper att
r
ibution
f
or code from textm
i
ning
.
o
rg
commit
|
commitdiff
|
tree
2007-11-13
Jukka Laur
i
Zitting
T
I
KA-100 - Structured
P
DF parsin
g
commit
|
commitdiff
|
tree
2007-11-06
Juk
k
a La
u
ri Zitting
T
IKA-87 - MimeTypes sho
u
ld
allow modific
a
tion of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-05
Juk
k
a L
a
u
ri Zitt
i
ng
TIKA
-
87 - MimeTy
p
es sho
u
ld a
l
l
o
w mo
d
ificatio
n
of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-04
Jukk
a
Lauri
Zitting
T
I
KA-87 - MimeTypes
should allow modification of MIM
E
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Jukka
L
auri
Z
it
t
in
g
TIKA-
8
7 - Mi
m
eTypes should allow m
o
dification of M
I
ME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Jukka Lauri Zitting
T
IK
A
-87
-
MimeTypes should allow modification of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-10-23
Jukka Lauri Zit
t
ing
T
I
K
A-87 - MimeTy
p
es should allow modificat
i
on of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Jukk
a
Lau
r
i Zitting
TIKA-85 - A
d
d
glob patterns f
r
om the ASF svn:eol-style
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Jukka Lau
r
i Zitting
TIKA
-
84 -
A
dd
M
imeTypes
.
getMimeType(I
n
putSt
r
eam)
commit
|
commitdiff
|
tree
2007-10-19
Ju
k
ka Lau
r
i Z
i
t
t
i
ng
TIKA-84
-
A
d
d Mi
m
e
Types
.
g
e
t
M
imeTy
p
e(InputSt
r
eam)
commit
|
commitdiff
|
tree
2007-10-19
Jukka Lauri Zitting
TI
K
A-83 - C
r
eate
a
org
.
apach
e
.
tika
.
s
a
x package for
.
.
.
commit
|
commitdiff
|
tree
2007-10-18
Jukka Lauri Zitting
Set
svn:eol-style t
o
n
at
i
ve
commit
|
commitdiff
|
tree
2007-10-18
Jukka L
a
uri
Zitting
Corre
c
t inde
n
t
i
ng
(
four
s
paces instead of one
a
s the
.
.
.
commit
|
commitdiff
|
tree
2007-10-16
Jukk
a
L
a
uri Zittin
g
TIK
A
-71 -
R
emove
Parse
r
Confi
g
a
nd Pars
e
rFactory
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Zitting
Remove
d
an
e
xtra debug p
r
int
commit
|
commitdiff
|
tree
2007-10-15
Jukka Laur
i
Zitting
TIKA-70 - Better
MIME inform
a
tio
n
for the Open Document
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
J
u
k
ka Laur
i
Zitting
TIK
A
-70 - Better M
I
ME inf
o
rmatio
n
for
t
he Open
Documen
t
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
Jukka La
u
ri Zitting
TIKA-6
7
-
A
dd an au
t
o-detecting
Pa
r
se
r
implem
e
n
tat
i
on
commit
|
commitdiff
|
tree
2007-10-15
Juk
k
a La
u
ri
Zitting
TIKA-68 - Add dummy parser class
e
s to be
used as se
n
tinels
commit
|
commitdiff
|
tree
2007-10-14
J
u
kka La
u
ri
Z
itting
T
I
KA-66
- Use
Java 5 featu
r
es in org
.
apache
.
t
i
ka
.
mime
commit
|
commitdiff
|
tree
2007-10-14
Juk
k
a
L
a
uri Zitt
i
ng
TIKA-
6
3 - Avoid multiple passes over
the
i
nput stream
.
.
.
commit
|
commitdiff
|
tree
2007-10-14
Jukka Lauri Zitting
T
IKA-60 - Rename Micr
o
soft parser classes
commit
|
commitdiff
|
tree
2007-10-14
Jukka Lauri Zit
t
i
n
g
TIKA-60 - Ren
a
me Micro
s
oft parser
c
la
s
s
e
s
commit
|
commitdiff
|
tree
2007-10-13
Jukka L
a
u
ri Zitting
TIKA-62 - Use TikaC
o
nfig
.
getDefaultC
o
nfig() inste
a
d
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
Ju
k
ka
L
auri Zittin
g
TI
K
A-57 - Rename org
.
a
p
ache
.
tika
.
ms to o
r
g
.
a
p
ach
e
.
tika
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
Jukka Lau
r
i Zitting
TIK
A
-5
3
- XH
T
ML SAX events from p
a
rsers
commit
|
commitdiff
|
tree
2007-10-10
Ju
k
ka Lauri
Z
itting
TIKA-
4
0
- Tika needs to s
u
pp
o
rt diverse chara
c
t
e
r
e
n
codi
n
gs
commit
|
commitdiff
|
tree
2007-10-08
Jukka Lauri Zitting
T
I
KA-41 - Re
s
ou
r
c
e
files
o
ccur twice in jar fil
e
commit
|
commitdiff
|
tree
2007-10-07
Jukka
L
a
uri
Z
itting
TIKA-45 - Rer
e
ada
b
le
I
npu
t
Stream needs to be able to
.
.
.
commit
|
commitdiff
|
tree
2007-10-07
Jukka
Lauri
Z
i
tting
TIKA-48
-
Me
r
ge MS
Extractor
s
and Parsers
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Zitting
TIKA-46 - Use
M
e
tadata in Parse
r
commit
|
commitdiff
|
tree
2007-10-07
Jukka
L
aur
i
Zitting
TIKA-
4
6
- Use Me
t
ad
a
t
a in Parser
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Zitting
S
e
t svn:eo
l
-style to native
commit
|
commitdiff
|
tree
2007-10-07
J
uk
k
a Lauri Zitting
T
IKA-46 - Use Met
a
data in
Parser
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Zitting
TIKA-4
7
- Remove Ti
k
aLogger
commit
|
commitdiff
|
tree
2007-10-07
Jukka L
a
uri Zitting
TIKA-43 - Parser i
n
te
r
face
commit
|
commitdiff
|
tree
2007-10-07
Jukka
Lauri Zitting
TIKA-43 - P
a
rser
int
e
rface
commit
|
commitdiff
|
tree
2007-10-05
Jukka Lauri Z
i
tting
TIKA-42
- Conten
t
class needs (S
t
ri
n
g,
S
tring, Stri
n
g
.
.
.
commit
|
commitdiff
|
tree
2007-10-05
Jukka Lau
r
i
Zit
t
ing
TIKA-44 -
S
pace
s
for
indentation
commit
|
commitdiff
|
tree
next