repo.or.cz
/
tika.git
/
search
commit
grep
author
committer
pickaxe
?
search:
re
summary
|
log
|
graphiclog1
|
graphiclog2
|
commit
|
commitdiff
|
tree
|
refs
|
edit
|
fork
first
·
prev
·
next
TIKA-139: Add a composite parser
2008-04-11
Jukka Lauri
Z
itti
n
g
TIKA-139: Add a co
m
p
o
site parse
r
commit
|
commitdiff
|
tree
2008-04-10
Ju
k
k
a Lauri Zitt
i
ng
Replaced tabs with spaces in tik
a
-mimetypes
.
xml
commit
|
commitdiff
|
tree
2008-04-10
Jukka Lauri Zitting
TIKA-113
:
Metadata (such as title) should
n
ot be part
.
.
.
commit
|
commitdiff
|
tree
2008-04-08
Jukka Lauri Zitt
i
ng
TIK
A
-138: Ign
o
re HTML style
a
nd scrip
t
content
commit
|
commitdiff
|
tree
2008-03-28
Jukka Lauri Zittin
g
T
I
KA-134:
mvn package does not produce
p
acka
g
es for
.
.
.
commit
|
commitdiff
|
tree
2008-03-28
Jukka La
u
ri
Z
i
tting
TIK
A
-123: Structured MS Office pars
i
ng
commit
|
commitdiff
|
tree
2008-03-28
Jukk
a
La
u
r
i Zitting
TIKA-123: Structured MS
O
f
fi
c
e
p
ar
s
ing
commit
|
commitdiff
|
tree
2008-03-28
Jukk
a
La
u
ri Z
i
tting
TIKA-132: R
e
factor Excel extr
a
ctor to parse per
sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-27
Jukka Lauri Zittin
g
Refo
r
m
a
tted N
O
TICE
t
o be les
s
v
erbo
s
e
commit
|
commitdiff
|
tree
2008-03-27
Ju
k
ka L
a
u
r
i Zitting
TIKA-97:
T
ika GUI
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri
Zitt
i
ng
TIKA
-
132: Refactor Excel extra
c
tor to parse per sh
e
et
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitting
TIKA-13
2
:
R
e
factor Exc
e
l extract
o
r to parse per
s
heet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
J
ukka Lau
r
i
Z
itting
T
IKA-132: Refactor
Excel extractor to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka L
a
uri Z
i
tting
TIKA-13
2
:
Refact
o
r Excel
ex
t
r
a
c
t
o
r
t
o parse per she
e
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zit
t
ing
T
IKA-132: Refactor Excel extractor
t
o pa
r
se
p
er sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Z
i
tting
TIKA-132: Refa
c
tor
Ex
c
el extractor to
p
arse
p
er s
h
eet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zi
t
ting
TIKA-132: Refactor Exce
l
extractor to parse per
s
hee
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukk
a
Lauri Zitti
n
g
TIKA-132
:
Refa
c
tor Excel
e
x
t
ractor
t
o p
a
rse per
s
h
e
et
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Juk
k
a Lauri
Zitting
TIKA-132:
R
efactor E
x
c
el extract
o
r
to
p
arse
pe
r
sh
e
et
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Ju
k
ka Lauri Zitting
TIKA-132: Refac
t
or Excel extract
o
r to par
s
e per
s
heet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Laur
i
Z
i
tting
TIKA-97: Tika
GUI
commit
|
commitdiff
|
tree
2008-03-26
J
u
kka Lauri
Zitting
TIKA
-
133: TeeCont
e
ntHandler constructor
sho
u
l
d use
.
.
.
commit
|
commitdiff
|
tree
2008-03-19
Jukka
L
aur
i
Zitting
TIKA-128: HTML parser
s
h
o
u
l
d
produce XHTML SAX events
commit
|
commitdiff
|
tree
2008-03-19
Jukka Lauri Zitting
TIKA
-
131: L
a
z
y XHTM
L
p
refix
g
eneratio
n
commit
|
commitdiff
|
tree
2008-03-18
Jukka Lauri Zitting
T
I
KA-130: sel
f
-or
-
desce
n
dant a
x
is doe
s
not match self
.
.
.
commit
|
commitdiff
|
tree
2008-03-18
Juk
k
a Lauri Zitting
TIKA-12
9
:
n
ode()
s
upport for t
h
e strea
m
ing XPat
h
utility
commit
|
commitdiff
|
tree
2008-03-09
J
ukka Lauri Zi
t
ting
TIKA-127: Add
s
upport for Visio file
s
commit
|
commitdiff
|
tree
2008-03-09
J
u
kk
a
La
u
ri Zitting
TIKA-126:
A
dd
P
arser
.
parse(Inp
u
tStream, Metad
a
ta) for
.
.
.
commit
|
commitdiff
|
tree
2008-03-09
Jukka
L
a
uri Zitt
i
ng
T
I
K
A
-123: Structure
d
M
S
Off
i
c
e parsi
n
g
commit
|
commitdiff
|
tree
2008-03-09
Jukka Lauri Zitt
i
ng
TIKA-123: Structure
d
MS Office par
s
in
g
commit
|
commitdiff
|
tree
2008-02-19
J
ukka
L
au
r
i Z
i
tting
TIKA-1
2
3: Struct
u
red MS Of
f
i
c
e
par
s
ing
commit
|
commitdiff
|
tree
2008-02-19
Jukka L
a
uri Zitting
TIKA-122: Use C
o
m
mon
s
I
O 1
.
4
commit
|
commitdiff
|
tree
2008-02-18
Jukk
a
Lauri Zitting
TIKA-123: Structured
MS Office
p
a
r
s
i
ng
commit
|
commitdiff
|
tree
2008-02-18
Jukk
a
L
auri Zitting
TIK
A
-123:
Stru
c
tur
e
d MS Off
i
ce parsing
commit
|
commitdiff
|
tree
2008-02-18
Jukka Lauri Zitting
TIKA-123: S
t
ructured
MS Offic
e
parsing
commit
|
commitdiff
|
tree
2008-02-18
Jukk
a
Lauri Zittin
g
TIKA-103:
Excel
p
arsing i
g
nores cell formating
commit
|
commitdiff
|
tree
2008-02-17
Jukka
L
auri
Z
itting
TIKA-123: Structur
e
d MS Office parsing
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lauri Zitting
TI
K
A-123: Struct
u
red M
S
Office
p
a
rsing
commit
|
commitdiff
|
tree
2008-02-17
Jukka La
u
r
i Zitting
TIKA-12
3
: Structured MS Office
p
a
r
sing
commit
|
commitdiff
|
tree
2008-02-17
Jukka
Lau
r
i Zi
t
t
i
ng
TIKA-123: Structured MS Office pa
r
sing
commit
|
commitdiff
|
tree
2008-01-26
Jukka Lauri
Z
i
ttin
g
TIKA-118: Bouncy Castle binaries re
q
uire US e
x
p
orts
.
.
.
commit
|
commitdiff
|
tree
2008-01-25
J
ukka Lauri Zitt
i
ng
TIKA-9
6
: Ti
k
a
CLI
commit
|
commitdiff
|
tree
2008-01-22
Jukka La
u
ri Z
i
tting
TIKA-
9
7
:
Tika GUI
commit
|
commitdiff
|
tree
2008-01-22
J
u
kka Lauri Zitting
TIKA-
9
7: T
i
k
a GUI
commit
|
commitdiff
|
tree
2008-01-22
J
u
kka Lau
r
i Zitting
TIK
A
-97: T
i
ka GUI
commit
|
commitdiff
|
tree
2008-01-22
Ju
k
ka Lauri Zittin
g
T
I
K
A-97
:
Tika GUI
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lauri Zitting
TIKA-115: Tika package with al
l
the dependenci
e
s
commit
|
commitdiff
|
tree
2008-01-21
J
ukka L
a
u
r
i Zitting
TIKA-117:
Drop J
D
OM and Jaxen
d
ependenci
e
s
commit
|
commitdiff
|
tree
2008-01-21
Juk
k
a Lauri
Z
itti
n
g
T
I
KA-116: Streaming parser f
o
r OpenDocument files
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lauri Zitting
TIKA-109
:
W
o
rdParser fai
l
s on some Word files
commit
|
commitdiff
|
tree
2008-01-20
J
ukka Lauri Z
i
t
t
in
g
TIK
A
-
10
5
: Excel parser
implementation
b
ased on POI
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Juk
k
a La
u
ri Zitting
TI
K
A-105: E
x
cel parser imp
l
em
e
n
t
ation base
d
on POI
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Jukka Lauri Zitting
TIKA-109: W
o
rd
P
arser
f
ai
l
s
on some Word f
i
les
commit
|
commitdiff
|
tree
2007-12-31
Jukka
L
auri
Z
itting
pom
.
xml: Updat
e
d
trunk
ver
s
ion to
0
.
2-
S
NA
P
SHOT
commit
|
commitdiff
|
tree
2007-12-26
J
u
kka Lauri Zitting
T
I
K
A-111
:
Mi
s
sing li
c
ense headers
commit
|
commitdiff
|
tree
2007-12-26
Jukka
L
auri Z
i
tti
n
g
TIKA-110: Add K
E
YS fil
e
f
o
r
Tika
commit
|
commitdiff
|
tree
2007-12-21
J
u
kka L
a
u
r
i
Zitting
TIKA-10
5
-
E
x
cel parse
r
impleme
n
tation based on POI
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka Lauri
Z
itting
T
I
K
A
-106 -
R
emo
v
e de
p
endency on Jakarta ORO -
u
se JDK
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka Lauri
Z
itting
TIK
A
-104 - Add u
t
ility
methods
t
o throw IOException
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka Lauri Zi
t
tin
g
T
I
KA-107 - Remove use of assertions fo
r
arg
u
ment
c
hecking
commit
|
commitdiff
|
tree
2007-11-25
Jukka Laur
i
Zitting
TIKA-
1
02
-
Par
s
er imple
m
ent
a
tions loading a large amount
.
.
.
commit
|
commitdiff
|
tree
2007-11-25
J
ukka L
a
uri Zitting
TIKA-102
-
Pa
r
ser implem
e
ntations
loading a larg
e
am
o
unt
.
.
.
commit
|
commitdiff
|
tree
2007-11-20
J
u
k
ka
Laur
i
Zitting
TIKA-
9
1: Ad
d
p
roper attrib
u
tion for code from tex
t
mining
.
org
commit
|
commitdiff
|
tree
2007-11-13
Jukka Laur
i
Zi
t
ting
T
I
KA-100 - Structured PDF
p
arsing
commit
|
commitdiff
|
tree
2007-11-06
Jukka La
u
ri
Zitting
TIKA-87
- MimeType
s
s
hould
allow mod
i
fica
t
ion of MI
M
E
.
.
.
commit
|
commitdiff
|
tree
2007-11-05
Jukka Lauri Zitting
TIKA-
8
7 - MimeTypes should allow
m
odi
f
i
c
ation
of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-04
Jukka Lau
r
i Zitti
n
g
TIK
A
-87 -
MimeTypes
shou
l
d
a
l
l
o
w modificat
i
on of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Jukka Lauri Zitting
TIKA-
8
7 - M
i
m
eTypes s
h
ould allow modification of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Jukka
Lauri Zitting
TIKA-8
7
-
M
imeTyp
e
s s
h
ould allow modification of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-10-23
Jukka
Lauri Zitting
TIKA
-
87
- MimeTypes
shou
l
d allow m
o
dificati
o
n of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
J
ukka Lauri Zitting
TIKA-8
5
- Add glob
p
atterns
f
rom the
A
SF svn:eol-style
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Jukka Lauri Zitting
T
IKA-84 - Add
M
imeTypes
.
getMimeType(InputSt
r
eam)
commit
|
commitdiff
|
tree
2007-10-19
Jukka Lauri Zitti
n
g
TIKA-
8
4 -
A
dd Mi
m
eType
s
.
getMimeType(I
n
p
u
tS
t
r
e
am)
commit
|
commitdiff
|
tree
2007-10-19
Juk
k
a Lauri Zitt
i
ng
TIKA
-
83 - Create a org
.
apache
.
ti
k
a
.
sax package for
.
.
.
commit
|
commitdiff
|
tree
2007-10-18
Jukka Lauri Zitting
Se
t
svn:
e
ol
-
style
t
o
native
commit
|
commitdiff
|
tree
2007-10-18
Jukka
La
u
ri Zitting
Corre
c
t inden
t
i
n
g (four spaces inst
e
a
d of one as the
.
.
.
commit
|
commitdiff
|
tree
2007-10-16
Juk
k
a Lauri Zi
t
ting
TIKA-71 - Re
m
ove ParserCon
f
ig and Pa
r
serFact
o
ry
commit
|
commitdiff
|
tree
2007-10-15
Juk
k
a Lauri Zitting
Remov
e
d
a
n extra debug print
commit
|
commitdiff
|
tree
2007-10-15
Ju
k
k
a
Lauri Zi
t
tin
g
TIK
A
-70 - Better MIME informati
o
n for the
O
pen Document
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Zitting
TIKA-70 - Better M
I
M
E
i
n
format
i
on for the O
p
en Do
c
ument
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
J
u
kka Laur
i
Zitting
TIKA-67 - Add an auto-
d
etecting Parser implem
e
nta
t
ion
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Zitti
n
g
TIKA-68 - Add
dummy parser
cla
s
ses to
b
e used
as
s
entinels
commit
|
commitdiff
|
tree
2007-10-14
J
u
k
k
a
Lauri Zitting
TIKA-
6
6
-
Use Java
5
f
e
a
tures in o
r
g
.
apache
.
t
i
ka
.
mime
commit
|
commitdiff
|
tree
2007-10-14
Jukka Lauri Zitting
TIKA
-
63 - Av
o
i
d
m
ultiple passes over the
i
nput stream
.
.
.
commit
|
commitdiff
|
tree
2007-10-14
Jukka
L
a
uri Zitting
TIKA-
6
0 - Ren
a
me Microso
f
t pars
e
r classes
commit
|
commitdiff
|
tree
2007-10-14
Jukka Lauri Zitt
i
ng
TIKA
-
60 - Rename Microsoft p
a
rs
e
r clas
s
es
commit
|
commitdiff
|
tree
2007-10-13
Ju
k
ka La
u
ri Zitting
TI
K
A-6
2
- Use TikaConfig
.
g
etD
e
faultConfig(
)
instea
d
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
Jukk
a
Lauri Zitting
TIKA-57 -
R
e
name o
r
g
.
apac
h
e
.
t
i
ka
.
m
s
t
o o
r
g
.
apache
.
tika
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
Jukk
a
Lauri Zi
t
ting
TIKA
-
53 - XH
T
ML
S
AX
e
vents from parsers
commit
|
commitdiff
|
tree
2007-10-10
J
ukka Lauri Zi
t
ting
TIKA-40 - T
i
ka needs
to support diverse characte
r
enco
d
i
ngs
commit
|
commitdiff
|
tree
2007-10-08
Jukka Lauri
Z
itting
T
IKA-41
-
R
esource files oc
c
u
r
twice in jar fi
l
e
commit
|
commitdiff
|
tree
2007-10-07
Jukka
Lauri Zitting
TIK
A
-45 - RereadableInputStream
n
e
eds
to be able to
.
.
.
commit
|
commitdiff
|
tree
2007-10-07
Juk
k
a Lauri Z
i
tting
TIKA
-
48 - Merge MS Extractor
s
and Parsers
commit
|
commitdiff
|
tree
2007-10-07
J
u
kka Lauri Z
i
tting
TIKA-46 - Use Metadata in Parser
commit
|
commitdiff
|
tree
2007-10-07
Ju
k
ka L
a
uri Zitting
TI
K
A-46 - Use Metadata in P
a
rser
commit
|
commitdiff
|
tree
2007-10-07
Jukka La
u
ri Zi
t
t
i
ng
S
et svn:eol
-
s
tyle to native
commit
|
commitdiff
|
tree
2007-10-07
Jukk
a
Lauri Zitting
TIK
A
-46 - Use
M
etada
t
a in Parser
commit
|
commitdiff
|
tree
2007-10-07
J
ukka Lauri
Z
itting
T
I
KA-4
7
- Remove TikaLogg
e
r
commit
|
commitdiff
|
tree
2007-10-07
Ju
k
k
a
Lauri Zit
t
ing
TI
K
A-43 -
P
arser i
n
terface
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Zit
t
ing
TI
K
A-43 - Par
s
er interface
commit
|
commitdiff
|
tree
next