repo.or.cz
/
tika.git
/
search
commit
grep
author
committer
pickaxe
?
search:
re
summary
|
log
|
graphiclog1
|
graphiclog2
|
commit
|
commitdiff
|
tree
|
refs
|
edit
|
fork
first
·
prev
·
next
TIKA-113: Metadata (such as title) should not be part of content
2008-04-10
Jukka Lau
r
i Zitting
T
I
KA-
1
13:
Metadata (such as
ti
t
le)
sho
u
ld not be part
.
.
.
commit
|
commitdiff
|
tree
2008-04-08
Jukk
a
Lau
r
i Z
i
t
ti
n
g
TIKA-138: Ignore H
T
M
L style
a
nd scrip
t
content
commit
|
commitdiff
|
tree
2008-03-28
Jukka Lauri Zitti
n
g
TIKA-13
4
: mvn package
d
oes
n
ot produce p
a
cka
g
es f
o
r
.
.
.
commit
|
commitdiff
|
tree
2008-03-28
Jukka Lauri Zitting
T
I
K
A
-
123: Struct
u
red MS
Office parsing
commit
|
commitdiff
|
tree
2008-03-28
Jukka Lau
r
i Zitting
T
IKA-
1
2
3
:
Structured
MS
Office parsing
commit
|
commitdiff
|
tree
2008-03-28
J
ukk
a
Lauri Zitting
TIKA-13
2
: Refactor Excel ex
t
r
a
c
tor to parse p
e
r shee
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-27
Jukka
Lauri Z
i
tting
Refor
m
a
t
ted NOTICE to be less verbose
commit
|
commitdiff
|
tree
2008-03-27
Jukka Lauri Zit
t
ing
TIKA-
9
7: Tika GU
I
commit
|
commitdiff
|
tree
2008-03-26
Jukka
Lauri
Z
it
t
in
g
TIKA-132: Refa
c
tor Excel
e
x
tr
a
ctor to
p
arse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitt
i
ng
TIKA-1
3
2
:
Refacto
r
E
xcel ext
r
a
c
to
r
t
o parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
J
u
k
k
a
L
a
uri Zitting
TIKA-13
2
: Refact
o
r Excel ex
t
r
a
c
t
o
r to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lau
r
i Zittin
g
TIKA-13
2
: Refactor Excel extr
a
ctor to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
J
u
kka Lauri Zit
t
ing
TIKA-132: Refactor Excel extractor to par
s
e per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
J
u
kka Lauri Zitti
n
g
TIKA-1
3
2:
Refactor Excel extrac
t
or to
parse per shee
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka L
a
u
ri Zitting
TIKA-13
2
: Refact
o
r Ex
c
el extractor to parse p
e
r sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka
L
auri Zitting
T
I
KA-132: Refactor Excel extractor to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitting
T
I
KA-132: Refactor Excel extractor to
p
a
rs
e
p
e
r sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukk
a
La
u
r
i
Z
i
tting
T
I
KA-132: R
e
factor
Excel extractor to parse pe
r
sh
e
e
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Juk
k
a Lauri Z
i
t
ti
n
g
TIKA-97: Tika GUI
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitt
i
ng
T
I
KA-133
:
TeeC
o
nt
e
ntHa
n
dler
construct
o
r sho
u
ld use
.
.
.
commit
|
commitdiff
|
tree
2008-03-19
Jukka La
u
ri
Zitting
TIK
A
-128: HT
M
L parser
sh
o
u
ld pro
d
uce XH
T
ML SA
X
events
commit
|
commitdiff
|
tree
2008-03-19
Jukka L
a
uri Zitting
TIKA
-
131: Lazy XHTML pref
i
x generat
i
on
commit
|
commitdiff
|
tree
2008-03-18
Jukk
a
Laur
i
Zitting
TIKA-130: s
e
lf-or-desc
e
ndant
a
xis
does not match self
.
.
.
commit
|
commitdiff
|
tree
2008-03-18
Jukka Lauri Zitting
TIKA-129:
node() support for
the streaming
X
Path util
i
t
y
commit
|
commitdiff
|
tree
2008-03-09
J
u
k
ka
L
auri Zittin
g
TIKA-127: Add suppor
t
for Visio file
s
commit
|
commitdiff
|
tree
2008-03-09
Jukka
L
auri
Z
it
t
ing
TIKA-126: Add Par
s
er
.
parse(InputStream, Metad
a
t
a
)
f
or
.
.
.
commit
|
commitdiff
|
tree
2008-03-09
J
u
kka
Lauri Zitt
i
n
g
T
IKA-1
2
3: S
t
r
u
ctured MS Office parsing
commit
|
commitdiff
|
tree
2008-03-09
J
u
k
ka Lauri Zitt
i
ng
T
I
KA-123: Structured MS Office parsing
commit
|
commitdiff
|
tree
2008-02-19
J
u
k
ka Lauri
Zitting
TIKA-123: Str
u
ct
u
red MS Office parsing
commit
|
commitdiff
|
tree
2008-02-19
J
u
kka Lauri
Z
ittin
g
TIK
A
-122: Use Commons IO 1
.
4
commit
|
commitdiff
|
tree
2008-02-18
J
u
kka Lauri Zittin
g
TIKA-12
3
:
Structured MS
O
ffice pa
r
si
n
g
commit
|
commitdiff
|
tree
2008-02-18
Juk
k
a Lauri Zitt
i
ng
T
I
KA-123: Structured MS Office parsing
commit
|
commitdiff
|
tree
2008-02-18
Jukka
L
auri Z
i
tting
T
I
KA-
1
23: Structured
MS
Of
f
ice parsin
g
commit
|
commitdiff
|
tree
2008-02-18
Jukka
Lauri Zitting
TIKA-103
:
Ex
c
e
l
parsin
g
ignores cell formating
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lauri Zit
t
ing
TIKA-123: Struc
t
ured MS Office p
a
rsin
g
commit
|
commitdiff
|
tree
2008-02-17
J
u
k
ka Lauri Zitting
TIKA
-
123:
S
tructured MS Offi
c
e
par
s
i
n
g
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lauri Zittin
g
T
IKA-123: St
r
u
cture
d
MS Office parsing
commit
|
commitdiff
|
tree
2008-02-17
J
ukk
a
Lauri Zitting
T
I
KA-123
:
Structured MS Office parsing
commit
|
commitdiff
|
tree
2008-01-26
Jukk
a
Lauri Z
i
tting
T
I
KA-118: Bou
n
cy
C
astle binaries
require US ex
p
o
r
ts
.
.
.
commit
|
commitdiff
|
tree
2008-01-25
Jukka
La
u
ri Zitting
TIKA-96: Ti
k
a CLI
commit
|
commitdiff
|
tree
2008-01-22
Ju
k
k
a
Laur
i
Zit
t
in
g
TIKA-97: Tika GUI
commit
|
commitdiff
|
tree
2008-01-22
Jukka Lauri Zitting
TIKA-97: T
i
ka GUI
commit
|
commitdiff
|
tree
2008-01-22
Jukka
L
a
uri Zitting
TIKA-
9
7: Tika
G
UI
commit
|
commitdiff
|
tree
2008-01-22
Jukka Laur
i
Zitting
TIKA-9
7
: Tika GUI
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lau
r
i
Z
itting
TIKA-115: T
i
ka package
w
ith all the dependenc
i
es
commit
|
commitdiff
|
tree
2008-01-21
Ju
k
ka Lauri Zitti
n
g
TIKA-1
1
7:
D
rop JDOM
a
nd
Jaxen depen
d
encie
s
commit
|
commitdiff
|
tree
2008-01-21
Jukk
a
Lauri Zi
t
ti
n
g
T
I
KA-11
6
: St
r
eaming pa
r
ser for OpenDocumen
t
files
commit
|
commitdiff
|
tree
2008-01-21
Jukka L
a
uri Zitting
TIKA-109: WordP
a
rser fails on some Word files
commit
|
commitdiff
|
tree
2008-01-20
Jukk
a
Laur
i
Zi
t
ting
TIKA-105: Excel parser impl
e
menta
t
i
on based on POI
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Jukka Lauri
Z
itting
TIKA-105: Excel pars
e
r
i
mplementation base
d
on PO
I
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Jukka Lauri Zitting
TI
K
A-10
9
: WordParser fails on some Word files
commit
|
commitdiff
|
tree
2007-12-31
Jukka L
a
u
ri Zitting
pom
.
xml: Updated trunk v
e
r
sion to 0
.
2-SNAPSHOT
commit
|
commitdiff
|
tree
2007-12-26
Juk
k
a
Lauri
Zitting
TIKA-
1
11:
M
issing license hea
d
e
r
s
commit
|
commitdiff
|
tree
2007-12-26
Jukka Lauri Zitti
n
g
TIKA-110: Ad
d
KEYS file f
o
r Tika
commit
|
commitdiff
|
tree
2007-12-21
Ju
k
ka
Lauri Zitting
TIKA
-
105 - Excel parser impl
e
mentatio
n
b
as
e
d
on POI
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka
Laur
i
Zitting
T
IKA-10
6
- R
e
move dependency on Jakarta ORO - use JDK
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukk
a
Lauri
Zi
t
t
ing
TI
K
A-10
4
- Add utility methods to thro
w
IOE
x
ception
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Juk
k
a
Lauri Zitting
TIKA-107 - Rem
o
ve us
e
of ass
e
rtions f
o
r argumen
t
c
he
c
king
commit
|
commitdiff
|
tree
2007-11-25
Jukka Lauri Zi
t
tin
g
TIKA-
1
02 -
Par
s
e
r
implemen
t
ation
s
lo
a
ding a
l
arge am
o
un
t
.
.
.
commit
|
commitdiff
|
tree
2007-11-25
Ju
k
ka
L
aur
i
Zitting
TIKA-10
2
-
Pars
e
r
i
mplemen
t
a
tions loading a large amount
.
.
.
commit
|
commitdiff
|
tree
2007-11-20
Jukka Lau
r
i Zitting
TIKA
-
91: Add prop
e
r att
r
i
b
ution for code
from textmining
.
org
commit
|
commitdiff
|
tree
2007-11-13
J
uk
k
a Lauri Zi
t
ting
TIKA-100 - Structured PDF par
s
i
ng
commit
|
commitdiff
|
tree
2007-11-06
Jukka Lauri Zi
t
ting
TIKA
-
87 - MimeTypes shou
l
d allow modif
i
cation of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-05
Jukka Laur
i
Z
it
t
ing
TIKA-87 - MimeTy
p
e
s
should allow
modification of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-04
Juk
k
a Lauri Zitting
T
I
KA-
8
7
-
MimeTypes should allo
w
m
odi
f
i
c
ation of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Juk
k
a
L
a
uri Z
i
t
t
ing
TIK
A
-
8
7
-
MimeTypes shoul
d
allow mo
d
ification of M
I
ME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
J
ukka Lau
r
i
Zi
t
ting
TIKA
-
8
7
-
M
i
meTypes sh
o
u
l
d allow modification of
M
IME
.
.
.
commit
|
commitdiff
|
tree
2007-10-23
Jukk
a
Lauri Zitting
TIKA-87 - MimeTypes shou
l
d allow mo
d
ification of
MIME
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Jukka Laur
i
Z
i
tti
n
g
TIKA-
8
5
-
Add glob
patterns from the ASF s
v
n:eo
l
-style
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
J
u
k
k
a L
a
u
ri Zit
t
ing
TIKA-84 -
A
d
d MimeTyp
e
s
.
getMim
e
T
y
pe(I
n
putStre
a
m)
commit
|
commitdiff
|
tree
2007-10-19
Jukka Lauri Zitti
n
g
TIKA-84 - Ad
d
Mi
m
e
T
ypes
.
ge
t
M
i
me
T
ype(InputStrea
m
)
commit
|
commitdiff
|
tree
2007-10-19
J
u
kka
L
auri
Z
it
t
ing
TIKA-83 - Crea
t
e a
o
rg
.
ap
a
che
.
tika
.
sax package for
.
.
.
commit
|
commitdiff
|
tree
2007-10-18
Juk
k
a Lauri Zitt
i
ng
Set svn
:
eol-style to nativ
e
commit
|
commitdiff
|
tree
2007-10-18
Jukka Lauri Zitting
C
o
rrect indenting (four
spaces instea
d
o
f
one
as the
.
.
.
commit
|
commitdiff
|
tree
2007-10-16
J
u
kka Lauri Zit
t
ing
TIKA-71 -
R
emov
e
Pars
e
rConfig a
n
d P
a
rs
e
rFa
c
to
r
y
commit
|
commitdiff
|
tree
2007-10-15
Jukka Laur
i
Zittin
g
Remo
v
ed an
e
xtra
d
ebug print
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Zitting
TIKA-70
-
Better MIME inf
o
rma
t
ion for the
Open Doc
u
ment
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Zitti
n
g
TIK
A
-
7
0 - Better
M
IM
E
informat
i
on
for
t
h
e
Op
e
n
D
o
cument
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Zitting
TIKA
-
67 - Add an auto-dete
c
ting
Parser imp
l
e
m
entati
o
n
commit
|
commitdiff
|
tree
2007-10-15
Ju
k
ka
Laur
i
Zi
t
ting
TIKA
-
68 - Add dummy p
a
rser classes to be u
s
ed as sentinels
commit
|
commitdiff
|
tree
2007-10-14
J
ukka L
a
uri Z
i
t
ting
T
I
KA-66 - Use
J
ava 5 f
e
atures in org
.
apac
h
e
.
tika
.
mime
commit
|
commitdiff
|
tree
2007-10-14
Jukk
a
Lauri Z
i
tting
TIKA-63
-
A
v
o
i
d
m
u
ltiple
p
asses
over the input stream
.
.
.
commit
|
commitdiff
|
tree
2007-10-14
Juk
k
a Lauri
Zittin
g
T
I
KA-60
-
Rename
M
icrosoft p
a
rse
r
classes
commit
|
commitdiff
|
tree
2007-10-14
J
u
kka
Lauri Zitting
T
I
KA-60
-
Renam
e
Mic
r
osoft parser classes
commit
|
commitdiff
|
tree
2007-10-13
Jukka Lauri Zitti
n
g
TIK
A
-62 - Use TikaConfig
.
get
D
efaultC
o
nf
i
g() instead
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
Jukka L
a
uri Zitting
TIK
A
-
57 -
Rename
o
rg
.
a
pac
h
e
.
tika
.
ms to org
.
apache
.
tika
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
Jukka
L
aur
i
Zitting
T
IKA-53 - XHTML SAX events from
p
arsers
commit
|
commitdiff
|
tree
2007-10-10
Jukka Lauri Zitting
TIKA-40 - Tika needs to suppor
t
divers
e
c
h
aracter enco
d
ings
commit
|
commitdiff
|
tree
2007-10-08
Jukka L
a
ur
i
Zitting
TIKA-41 - R
e
source files
o
ccur twi
c
e in jar fil
e
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri
Z
itting
T
IKA-45 - RereadableInputStream needs to
b
e abl
e
to
.
.
.
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Zittin
g
T
I
KA
-
48
-
Merge
M
S
Extractors and Pa
r
sers
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri
Z
i
t
ting
T
I
KA-4
6
- Us
e
Metad
a
ta in Parser
commit
|
commitdiff
|
tree
2007-10-07
Jukka
Lauri
Zitting
TIKA-46 - Use Metad
a
ta
in P
a
r
s
e
r
commit
|
commitdiff
|
tree
2007-10-07
Jukka L
a
u
r
i
Z
itting
Set svn:eol-style to na
t
ive
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Z
i
tting
T
I
KA-46 - Use Me
t
adata
in Parse
r
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lau
r
i
Zitting
TIKA
-
47 - Remov
e
TikaLogger
commit
|
commitdiff
|
tree
2007-10-07
Jukka La
u
r
i
Zitting
TIKA-43 -
P
arser
i
nterface
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Zittin
g
TIKA-43 -
Parser inte
r
face
commit
|
commitdiff
|
tree
2007-10-05
Jukka Lau
r
i Zi
t
ting
TIKA-4
2
-
C
o
n
tent class
n
eeds (String, String, Str
i
ng
.
.
.
commit
|
commitdiff
|
tree
2007-10-05
Jukka Lauri Zitting
TIK
A
-44 - Spaces f
o
r in
d
e
n
tation
commit
|
commitdiff
|
tree
next