repo.or.cz
/
tika.git
/
search
commit
grep
author
committer
pickaxe
?
search:
re
summary
|
log
|
graphiclog1
|
graphiclog2
|
commit
|
commitdiff
|
tree
|
refs
|
edit
|
fork
first
·
prev
·
next
TIKA-113: Metadata (such as title) should not be part of content
2008-04-10
Ju
k
ka Lauri Zi
t
ting
TIK
A
-113: Meta
d
ata (such
as title) should not be part
.
.
.
commit
|
commitdiff
|
tree
2008-04-08
Jukka Laur
i
Zitting
TI
K
A-
1
38: Ignore H
T
ML
s
tyle and
script conten
t
commit
|
commitdiff
|
tree
2008-03-28
Jukka Lauri Zitting
T
IKA-134: mvn p
a
ckage does not p
r
o
d
uc
e
packages for
.
.
.
commit
|
commitdiff
|
tree
2008-03-28
Jukka Laur
i
Zit
t
ing
TIKA
-
123
:
Struc
t
ur
e
d
M
S
Office pa
r
sing
commit
|
commitdiff
|
tree
2008-03-28
Juk
k
a Lau
r
i Zitt
i
ng
TIKA-1
2
3: S
t
ructured MS Off
i
ce parsing
commit
|
commitdiff
|
tree
2008-03-28
Jukka Lauri Zit
t
in
g
T
I
KA-132:
R
efactor
Excel extra
c
tor
to parse
per
s
h
e
et
.
.
.
commit
|
commitdiff
|
tree
2008-03-27
Jukka
L
auri Zitting
Reformatt
e
d
N
OTICE to be less ve
r
bose
commit
|
commitdiff
|
tree
2008-03-27
Jukka Lauri
Z
itting
TIKA
-
97: T
i
ka
G
UI
commit
|
commitdiff
|
tree
2008-03-26
Jukka L
a
uri
Z
itting
TI
K
A-132: Ref
a
ctor Excel extractor to pars
e
per
s
he
e
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukk
a
Lauri Zitti
n
g
TIKA-132: Refac
t
or Excel
e
xtract
o
r
to
pa
r
s
e
per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka
L
auri Zitting
T
I
K
A
-
132:
R
efa
c
tor Excel ex
t
ractor to parse
p
er sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitting
TIKA-132:
R
efactor Excel extractor to
parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitting
TIKA-1
3
2: Refact
o
r Excel extracto
r
to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri
Zitting
TIKA-132: Refactor E
x
cel extractor to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri
Zitting
TIK
A
-132: Refactor
E
x
cel extractor to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitting
TIKA
-
132
:
Refactor Excel ex
t
ractor to
p
arse
p
er sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitti
n
g
TIKA-132: Re
f
actor E
x
c
e
l
ext
r
actor to par
s
e
p
e
r sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Juk
k
a Lauri Zitting
TIKA-1
3
2: Refac
t
or Excel e
x
tract
o
r t
o
parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zit
t
ing
TI
K
A-9
7
: Tika GUI
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lau
r
i Z
i
tti
n
g
TI
K
A-
1
33: TeeCo
n
tentHa
n
dl
e
r constructor
shoul
d
use
.
.
.
commit
|
commitdiff
|
tree
2008-03-19
Jukka
La
u
ri Zitting
T
I
KA
-
1
2
8: H
T
ML parser should produce
XHT
M
L SAX
events
commit
|
commitdiff
|
tree
2008-03-19
Juk
k
a
Lauri Zit
t
in
g
TIKA-131: Lazy XH
T
ML
p
refix
generation
commit
|
commitdiff
|
tree
2008-03-18
Jukk
a
Lauri Zitting
T
I
KA-130: self-or-desc
e
ndant axis does not match self
.
.
.
commit
|
commitdiff
|
tree
2008-03-18
Jukka La
u
ri Z
i
tting
T
IKA-129: n
o
de(
)
su
p
po
r
t for th
e
stream
i
ng XPath
utility
commit
|
commitdiff
|
tree
2008-03-09
Jukka Lauri Zittin
g
T
I
KA-127: Add
s
u
pport for Visio fil
e
s
commit
|
commitdiff
|
tree
2008-03-09
Ju
k
ka Lauri Zitting
T
I
KA-12
6
:
Add Parser
.
parse(InputStream, Met
a
data) fo
r
.
.
.
commit
|
commitdiff
|
tree
2008-03-09
Jukka L
a
uri
Z
i
t
t
i
ng
TIKA-123:
S
tr
u
ctured MS Office parsing
commit
|
commitdiff
|
tree
2008-03-09
Jukka La
u
r
i
Zitting
TIKA-123: St
r
uctured MS Of
f
ic
e
p
a
rs
i
ng
commit
|
commitdiff
|
tree
2008-02-19
Jukka Lauri Zittin
g
TIKA-123: Stru
c
tur
e
d MS Off
i
ce parsing
commit
|
commitdiff
|
tree
2008-02-19
Jukka Lauri Zitting
TI
K
A-122:
U
se Commons
IO 1
.
4
commit
|
commitdiff
|
tree
2008-02-18
Juk
k
a Lauri Zitting
T
I
KA-123: Structured MS Office pa
r
sin
g
commit
|
commitdiff
|
tree
2008-02-18
Jukka
Lauri Zitting
TIKA-123
:
S
truc
t
ured MS Off
i
c
e parsing
commit
|
commitdiff
|
tree
2008-02-18
Jukka Lauri Zit
t
ing
TIK
A
-123: Struc
t
ured MS Offi
c
e pars
i
ng
commit
|
commitdiff
|
tree
2008-02-18
J
ukka Lauri Zitting
T
I
KA-
1
03: Excel parsing ignore
s
cel
l
formating
commit
|
commitdiff
|
tree
2008-02-17
Ju
k
ka L
a
ur
i
Zitting
TIKA-123: Structured MS Office pa
r
sing
commit
|
commitdiff
|
tree
2008-02-17
Juk
k
a La
u
ri Zitting
TIK
A
-123:
Str
u
ctured
M
S
Office parsing
commit
|
commitdiff
|
tree
2008-02-17
Ju
k
ka Lauri Zitting
TIKA-123: Structured
MS Office pars
i
ng
commit
|
commitdiff
|
tree
2008-02-17
Jukka
L
auri
Z
itting
TIKA-123: St
r
uctured MS Office pa
r
sing
commit
|
commitdiff
|
tree
2008-01-26
J
ukk
a
Lauri
Zitting
TIKA-118:
Bounc
y
Castle binaries require US exports
.
.
.
commit
|
commitdiff
|
tree
2008-01-25
Jukka
L
auri
Zitt
i
ng
TIKA-96: Tika
C
LI
commit
|
commitdiff
|
tree
2008-01-22
Jukka La
u
ri Zi
t
tin
g
TI
K
A-97: Tika GUI
commit
|
commitdiff
|
tree
2008-01-22
J
uk
k
a Lauri Zitting
TI
K
A-97: Ti
k
a GUI
commit
|
commitdiff
|
tree
2008-01-22
Jukka L
a
uri Zitt
i
ng
TIKA-
9
7: Tik
a
G
U
I
commit
|
commitdiff
|
tree
2008-01-22
Jukka Lauri Zittin
g
TIKA-97: Tika GUI
commit
|
commitdiff
|
tree
2008-01-21
J
u
kka Lauri
Zit
t
ing
T
I
KA-115: T
i
ka package wit
h
all the dependencies
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lauri Zit
t
ing
TIKA-117:
D
rop JDOM
a
n
d
Jaxe
n
depen
d
encies
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lau
r
i Zi
t
ting
TI
K
A
-116: Streaming p
a
rser for Ope
n
D
ocument files
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lauri Zitting
TIKA-109: Wo
r
dParser
f
ai
l
s on some Word files
commit
|
commitdiff
|
tree
2008-01-20
Jukka La
u
ri Z
i
tting
TIKA-10
5
: Excel parse
r
implement
a
ti
o
n based on POI
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Jukka
L
aur
i
Z
ittin
g
TIKA
-
10
5
: Excel parser
implementation base
d
on POI
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Jukka La
u
r
i
Z
itti
n
g
TIKA-109: Wo
r
dParser fai
l
s on s
o
m
e Word file
s
commit
|
commitdiff
|
tree
2007-12-31
Jukka Lau
r
i Zitting
pom
.
xml:
Up
d
a
ted tr
u
nk vers
i
o
n
to
0
.
2-SNAPSHOT
commit
|
commitdiff
|
tree
2007-12-26
Juk
k
a Lau
r
i Zitting
TIKA-
1
11: Missing
l
icense
hea
d
ers
commit
|
commitdiff
|
tree
2007-12-26
Jukk
a
L
a
uri Zitting
TIKA-110: Add KE
Y
S
f
ile
fo
r
Tika
commit
|
commitdiff
|
tree
2007-12-21
J
u
kka Lauri Zitting
TIKA-105
-
Excel par
s
er implementation b
a
sed on
P
O
I
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka
Lauri Zitting
TIKA-106
- Remove dependen
c
y
on Jaka
r
ta ORO
-
u
se J
D
K
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukk
a
La
u
ri Zit
t
ing
TIKA-104 - Ad
d
utility me
t
hods to th
r
ow IOExce
p
tion
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka Lauri
Zitting
T
IKA-1
0
7 -
R
emove use of asse
r
tio
n
s fo
r
argument
c
hecking
commit
|
commitdiff
|
tree
2007-11-25
J
uk
k
a Lauri Zitting
TIKA-102
- Pa
r
ser imp
l
eme
n
t
a
tions load
i
ng
a large
a
mount
.
.
.
commit
|
commitdiff
|
tree
2007-11-25
J
ukka
Lau
r
i Zitt
i
ng
TIK
A
-102 - Parser
i
mplementations load
i
ng a large a
m
o
unt
.
.
.
commit
|
commitdiff
|
tree
2007-11-20
Jukka Lauri Zitting
TIKA-91: Add
proper a
t
tribu
t
ion for
c
o
de from textmining
.
org
commit
|
commitdiff
|
tree
2007-11-13
Ju
k
ka Lauri Zitting
TIKA-100
- Structured PDF parsing
commit
|
commitdiff
|
tree
2007-11-06
Jukka Lauri Z
i
tting
TIKA-
8
7 - MimeTypes should
allow mo
d
ification o
f
M
I
ME
.
.
.
commit
|
commitdiff
|
tree
2007-11-05
Ju
k
ka Lauri Z
i
tt
i
ng
TIK
A
-87 - MimeT
y
pes
s
hould allow modificatio
n
of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-04
Jukka L
a
uri Zittin
g
TIKA-
8
7 -
MimeTypes should al
l
o
w
modification o
f
MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
J
ukka Lauri Zi
t
ting
TIK
A
-87 - MimeTyp
e
s shoul
d
a
l
low mod
i
fication of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
J
ukka L
a
uri Zitti
n
g
TIKA-87
-
MimeTy
p
es should allow
m
odification of M
I
ME
.
.
.
commit
|
commitdiff
|
tree
2007-10-23
J
u
kka Lauri Zit
t
ing
TIKA-87 - MimeTypes shoul
d
allow
m
odification of
M
IME
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Jukka Lauri Zitting
T
I
KA-85 - Add glob pat
t
erns f
r
om
t
he ASF sv
n
:eol-style
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Jukka Lauri
Z
itting
T
I
KA-84 - Ad
d
M
i
m
e
T
y
pes
.
getM
i
meType(I
n
put
S
tr
e
am)
commit
|
commitdiff
|
tree
2007-10-19
Ju
k
ka Laur
i
Zitting
TIKA-
8
4 - Ad
d
MimeT
y
pes
.
getM
i
meType(InputSt
r
e
am)
commit
|
commitdiff
|
tree
2007-10-19
Jukka Lauri Zitting
TIK
A
-83 - Create a org
.
apache
.
tika
.
sa
x
package
f
or
.
.
.
commit
|
commitdiff
|
tree
2007-10-18
Ju
k
k
a
Lauri Zitti
n
g
S
et s
v
n:eol-style to na
t
ive
commit
|
commitdiff
|
tree
2007-10-18
Jukka Lau
r
i Zitt
i
ng
Correct indenting (fo
u
r
s
paces instead of o
n
e a
s
the
.
.
.
commit
|
commitdiff
|
tree
2007-10-16
Ju
k
ka Lauri Zi
t
t
i
ng
TIKA-71
- Remove ParserCon
f
ig
a
nd ParserFactory
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lau
r
i Zit
t
i
ng
R
emoved an extra
debug
p
r
i
nt
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Zitti
n
g
TIKA-70 - Better M
I
ME
i
nformat
i
on for
t
h
e
O
p
e
n
Document
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lau
r
i Z
i
tting
TIKA-7
0
- Better MIME i
n
formation for the Open Document
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Zitting
TIKA
-
67 - Add an auto-detecting Parser im
p
lementation
commit
|
commitdiff
|
tree
2007-10-15
Jukka
Lau
r
i
Zitting
TIKA-68
- Add dummy p
a
r
s
er c
l
asses to be
used as
s
entinels
commit
|
commitdiff
|
tree
2007-10-14
Jukk
a
L
a
uri Zitting
TIKA
-
6
6
- Use Java
5
features in org
.
apache
.
tik
a
.
mime
commit
|
commitdiff
|
tree
2007-10-14
Jukka Laur
i
Zitting
TIKA-63
- Avoi
d
multiple
p
asses
o
ver t
h
e
i
n
put s
t
re
a
m
.
.
.
commit
|
commitdiff
|
tree
2007-10-14
Jukka Laur
i
Zitt
i
n
g
TIKA-60 - R
e
name
Micros
o
ft p
a
rser clas
s
e
s
commit
|
commitdiff
|
tree
2007-10-14
Jukka Lauri
Zitting
TIKA-60 - Rename Micro
s
oft
p
arser cla
s
ses
commit
|
commitdiff
|
tree
2007-10-13
Jukk
a
Lauri Zitt
i
ng
TIKA-6
2
-
Use TikaConfig
.
g
etDefaultConfig()
instead
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
Jukk
a
Lauri
Zitting
TIKA-57
- Rename
o
rg
.
apache
.
t
ika
.
ms to or
g
.
apa
c
he
.
t
i
ka
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
Jukka La
u
r
i
Zitting
TIKA-53 - XHTML SAX events f
r
om
p
arsers
commit
|
commitdiff
|
tree
2007-10-10
Jukka
Lauri Zi
t
ting
TIKA-40 -
Tika needs
t
o support div
e
rse c
h
a
rac
t
er en
c
oding
s
commit
|
commitdiff
|
tree
2007-10-08
Jukka Lauri
Z
i
tting
T
I
KA
-
41 - Resourc
e
files occur twic
e
in jar file
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Zitting
TIKA-45 - Re
r
e
a
dableInput
S
tre
a
m needs to
b
e able
t
o
.
.
.
commit
|
commitdiff
|
tree
2007-10-07
Ju
k
k
a
Lauri Zitting
TIKA-48 - Me
r
ge MS Extra
c
tors an
d
Parsers
commit
|
commitdiff
|
tree
2007-10-07
J
u
kka
L
a
uri Z
i
tting
T
IKA-
4
6 -
U
se Metadata in Parser
commit
|
commitdiff
|
tree
2007-10-07
Jukka L
a
uri Zitt
i
ng
TIKA-46 -
U
se
M
etadata in Parse
r
commit
|
commitdiff
|
tree
2007-10-07
Ju
k
ka Lauri Z
i
tting
Set s
v
n:eol-style to nati
v
e
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lau
r
i Zittin
g
T
I
KA-
4
6 - Use Metadata in Par
s
er
commit
|
commitdiff
|
tree
2007-10-07
J
u
kka
Lauri Zitting
TIKA-47 - Remove TikaLogge
r
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Zit
t
ing
TI
K
A
-43
-
Par
s
er int
e
rface
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Z
i
ttin
g
TIKA-43 - Pa
r
ser interf
a
ce
commit
|
commitdiff
|
tree
2007-10-05
J
u
k
ka Lauri Zitting
TIKA-42 - Conte
n
t class ne
e
ds (String,
Str
i
ng,
S
t
ring
.
.
.
commit
|
commitdiff
|
tree
2007-10-05
Jukka L
a
uri Zitting
TIKA-
4
4 - Spaces for indent
a
tion
commit
|
commitdiff
|
tree
next