repo.or.cz
/
tika.git
/
search
commit
grep
author
committer
pickaxe
?
search:
re
summary
|
log
|
graphiclog1
|
graphiclog2
|
commit
|
commitdiff
|
tree
|
refs
|
edit
|
fork
first
·
prev
·
next
Modified svn:ignore to cover things like ".checkstyle".
2008-06-06
J
u
kka
L
auri Zi
t
ting
Mo
d
ified
s
v
n:igno
r
e
t
o
c
over things
l
ike "
.
checkstyle"
.
commit
|
commitdiff
|
tree
2008-06-06
Jukka Lauri Zit
t
ing
TIKA-143:
A
dd Parsi
n
g
Reade
r
commit
|
commitdiff
|
tree
2008-05-06
Jukka La
u
ri Zit
t
ing
Simplif
i
ed log4j configuration for unit
t
es
t
s
commit
|
commitdiff
|
tree
2008-05-06
Jukka L
a
uri Z
i
tting
TIKA-92: Im
a
ge met
a
data extraction
commit
|
commitdiff
|
tree
2008-05-05
Jukka Lauri Zitting
T
IKA-87: MimeT
y
pes s
h
ould allow
modification of MIM
E
.
.
.
commit
|
commitdiff
|
tree
2008-04-11
J
u
kka La
u
r
i Zitting
TIK
A
-139:
Add a co
m
posite parser
commit
|
commitdiff
|
tree
2008-04-10
Jukka
L
a
uri Zitti
n
g
Re
p
laced t
a
bs
w
ith spaces in tika-mime
t
ype
s
.
x
m
l
commit
|
commitdiff
|
tree
2008-04-10
Jukka Lau
r
i
Z
itt
i
ng
T
IKA-113: Metadata (su
c
h as title)
sh
o
uld not b
e
pa
r
t
.
.
.
commit
|
commitdiff
|
tree
2008-04-08
Jukka La
u
r
i
Zitti
n
g
TIKA-138: Igno
r
e
HT
M
L
st
y
l
e
and
s
cript content
commit
|
commitdiff
|
tree
2008-03-28
Jukka Lauri Zitting
TIKA
-
1
34: m
v
n packag
e
do
e
s not
p
ro
d
uce
p
ackages for
.
.
.
commit
|
commitdiff
|
tree
2008-03-28
Jukka La
u
ri Zitting
TIKA-123: Structu
r
e
d MS Office parsing
commit
|
commitdiff
|
tree
2008-03-28
J
u
k
ka Lauri Zitti
n
g
TIKA-12
3
: St
r
uctured M
S
Office parsing
commit
|
commitdiff
|
tree
2008-03-28
Jukka
L
auri Zitting
T
IKA-132: Refa
c
tor Excel e
x
t
r
actor t
o
parse per she
e
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-27
Ju
k
ka Lauri Zitting
R
eformatted
N
OTICE to
be less
v
erbose
commit
|
commitdiff
|
tree
2008-03-27
J
u
kka Lauri Zitt
i
ng
TI
K
A-97: Tika GU
I
commit
|
commitdiff
|
tree
2008-03-26
J
ukka
L
aur
i
Zi
t
ting
TIKA-1
3
2: Refactor Excel ext
r
actor to
p
a
rse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
J
u
kka Lauri Zitting
TIKA
-
132: Refactor Excel extract
o
r to
parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka
L
auri Zitting
TIKA-1
3
2: Refacto
r
Excel ext
r
ac
t
or to pa
r
se pe
r
sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zi
t
t
ing
TIKA-132:
Refactor Excel extracto
r
to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukk
a
Lauri Zittin
g
TIKA-132: Refactor Exce
l
extractor to parse pe
r
sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
J
u
kka Lauri Zitting
TIKA-132
:
Refactor Excel e
x
tractor
to p
a
rse per she
e
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka L
a
u
r
i Zitting
TIKA-132: Refactor Excel
e
x
tract
o
r to p
a
rse
p
e
r sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lau
r
i Zitting
TIKA-132: Refac
t
or Excel
extractor
t
o par
s
e
p
e
r
shee
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitting
T
I
K
A-132:
Refactor Excel extractor to par
s
e per s
h
ee
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka
Lauri Z
i
tting
TIKA-132: Refac
t
or Exc
e
l ext
r
a
c
tor
t
o parse p
e
r s
h
eet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka La
u
ri Zitting
TIKA-97:
Tika GUI
commit
|
commitdiff
|
tree
2008-03-26
J
u
kka Lauri Zi
t
ting
TI
K
A
-
133: TeeContentHandler constru
c
tor s
h
ould
u
se
.
.
.
commit
|
commitdiff
|
tree
2008-03-19
Jukka Lauri Z
i
tting
TIKA-128: HTML
p
arser
should produce XHTML SAX even
t
s
commit
|
commitdiff
|
tree
2008-03-19
Juk
k
a
Lauri Zitt
i
ng
TIKA-131: Lazy XHTML prefix gen
e
ration
commit
|
commitdiff
|
tree
2008-03-18
Jukka Lauri
Zit
t
ing
TIKA-130: self-o
r
-descendant a
x
is do
e
s
not match self
.
.
.
commit
|
commitdiff
|
tree
2008-03-18
Jukka Lauri Zitting
T
I
KA-129: node(
)
suppor
t
f
o
r the s
t
reami
n
g XPath utility
commit
|
commitdiff
|
tree
2008-03-09
Jukka Lauri Z
i
tting
TIK
A
-1
2
7: Add s
u
p
p
ort for
V
isio files
commit
|
commitdiff
|
tree
2008-03-09
Jukka Lauri Zitting
TIKA-126:
Ad
d
Parser
.
p
arse(Inp
u
tS
t
ream, Metadata) for
.
.
.
commit
|
commitdiff
|
tree
2008-03-09
Jukka Lauri
Z
ittin
g
TI
K
A-123:
Structured
M
S
Office p
a
rsing
commit
|
commitdiff
|
tree
2008-03-09
Jukka
L
auri Zittin
g
TIKA-123:
Struc
t
ur
e
d M
S
Office p
a
r
sing
commit
|
commitdiff
|
tree
2008-02-19
Jukka L
a
ur
i
Zitti
n
g
TIKA-123: Structu
r
ed MS
O
ffice p
a
rsing
commit
|
commitdiff
|
tree
2008-02-19
Jukka L
a
u
r
i Zitting
T
IKA-
1
22: Use Commons IO 1
.
4
commit
|
commitdiff
|
tree
2008-02-18
Jukka L
a
uri Z
i
tting
TIKA-1
2
3: S
t
ructured
MS Office par
s
i
n
g
commit
|
commitdiff
|
tree
2008-02-18
J
u
kka Lauri
Z
itting
TIKA
-
123: Structur
e
d MS Offi
c
e pars
i
ng
commit
|
commitdiff
|
tree
2008-02-18
Jukka Lauri Zitting
TIKA-1
2
3:
S
t
r
u
c
t
ur
e
d MS
O
f
fice p
a
rsin
g
commit
|
commitdiff
|
tree
2008-02-18
Jukka Lauri
Z
i
t
t
i
n
g
T
IKA-103
:
E
xce
l
parsing ignores cell formating
commit
|
commitdiff
|
tree
2008-02-17
Jukka
Lauri Zitti
n
g
TIK
A
-123: Structured MS Office
parsing
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lauri Zitting
TIKA-1
2
3: S
t
ructured
M
S Office pars
i
ng
commit
|
commitdiff
|
tree
2008-02-17
J
u
kka Lauri Z
i
tting
TIKA-123: Str
u
ct
u
r
e
d
M
S
Office pa
r
sing
commit
|
commitdiff
|
tree
2008-02-17
Ju
k
ka Lauri Zittin
g
TIKA-123: Str
u
ctured MS O
f
fice pa
r
sin
g
commit
|
commitdiff
|
tree
2008-01-26
Jukka La
u
r
i Zitt
i
ng
TIKA-118: Bouncy
C
astle b
i
n
a
ries requir
e
US exports
.
.
.
commit
|
commitdiff
|
tree
2008-01-25
Jukk
a
Lauri Zitting
TIKA-96: Tika CLI
commit
|
commitdiff
|
tree
2008-01-22
Jukka La
u
ri Zitting
TIKA-97: Tika
GUI
commit
|
commitdiff
|
tree
2008-01-22
Juk
k
a
La
u
ri Zitting
TIKA-97: Tika
GUI
commit
|
commitdiff
|
tree
2008-01-22
Jukka
Lauri Zit
t
ing
T
I
KA-97: Ti
k
a GU
I
commit
|
commitdiff
|
tree
2008-01-22
Jukka
Lauri Zitting
TIKA-97: T
i
ka
G
UI
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lauri Zitting
TIKA-1
1
5: Tika pa
c
kage with all
t
he d
e
p
endencies
commit
|
commitdiff
|
tree
2008-01-21
Jukka
L
auri Zi
t
ting
TI
K
A-1
1
7
: D
r
o
p
JDOM and
J
axen
dependen
c
ies
commit
|
commitdiff
|
tree
2008-01-21
Jukka
L
auri
Z
it
t
in
g
TIKA-116:
S
treaming
parser for OpenDocument f
i
les
commit
|
commitdiff
|
tree
2008-01-21
Ju
k
ka L
a
u
r
i Zitting
TIKA
-
10
9
:
WordP
a
rser f
a
ils on some Wo
r
d fi
l
es
commit
|
commitdiff
|
tree
2008-01-20
Jukka Lauri Zitting
TIKA-105: Excel pars
e
r impleme
n
t
a
tion based on
POI
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Jukka
Lauri Zitting
TI
K
A
-105: Exc
e
l parser i
m
pleme
n
ta
t
io
n
based on POI
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Jukka Lauri
Z
i
tting
T
I
KA-109: Wor
d
Parser fails on some Word fi
l
es
commit
|
commitdiff
|
tree
2007-12-31
Ju
k
ka Lauri Zitting
pom
.
xml
:
Updated trunk ve
r
sion to 0
.
2-SNAPSHOT
commit
|
commitdiff
|
tree
2007-12-26
Jukka Lauri Zitting
TIKA-1
1
1: Missin
g
license
h
eaders
commit
|
commitdiff
|
tree
2007-12-26
Jukka La
u
ri Z
i
t
t
i
n
g
TIKA-110: Add
K
EYS fil
e
for
T
i
k
a
commit
|
commitdiff
|
tree
2007-12-21
Jukka Lauri Zitting
TI
K
A-105
-
Excel parser imp
l
ementati
o
n
based on P
O
I
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Juk
k
a Lauri Zi
t
ting
TIKA-106 - R
e
m
o
ve d
e
pendency
on
J
akarta ORO - use JDK
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Ju
k
ka Lauri Zitti
n
g
TIKA-104 - Add
u
ti
l
ity me
t
hods
to th
r
ow
IOException
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Juk
k
a La
u
r
i
Z
itting
TIKA-107
- Remove use
o
f assertions
f
o
r arg
u
ment checking
commit
|
commitdiff
|
tree
2007-11-25
Juk
k
a
Lau
r
i Zitting
TIK
A
-
1
02 - Parser implem
e
nta
t
ions loading a
l
arge a
m
ount
.
.
.
commit
|
commitdiff
|
tree
2007-11-25
Jukka
L
aur
i
Zitting
T
I
KA-1
0
2 - Pa
r
ser implemen
t
a
t
ions
loading
a
large amount
.
.
.
commit
|
commitdiff
|
tree
2007-11-20
Jukka Lauri Z
i
tti
n
g
T
I
K
A
-91: Add proper attribution for code from textmi
n
ing
.
o
r
g
commit
|
commitdiff
|
tree
2007-11-13
Jukka Lauri
Z
i
t
tin
g
TI
K
A-100 -
S
t
ructu
r
ed P
D
F parsi
n
g
commit
|
commitdiff
|
tree
2007-11-06
Jukka La
u
r
i Zitting
TI
K
A-87
-
MimeTypes
s
hould
allow mo
d
if
i
c
atio
n
of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-05
Jukka Lauri Zitting
TIKA-
8
7 - Mi
m
eTyp
e
s shoul
d
a
llow modific
a
tion
o
f
MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-04
Juk
k
a Lauri Zit
t
ing
TIKA-87
-
Mim
e
Type
s
should allow
modif
i
catio
n
of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Jukka Lauri Zi
t
ting
TIKA-
8
7
-
MimeType
s
should allow modif
i
c
a
tion o
f
MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Jukka Lauri Zitting
TIK
A
-8
7
- Mime
T
ypes
should allow modif
i
cation of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-10-23
J
u
kka Lauri Zi
t
ting
TIKA
-
8
7
- MimeTy
p
es should a
l
low
m
odif
i
cation of MIM
E
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Jukka Lauri Zittin
g
TIK
A
-
8
5
- Add glob patterns from t
h
e ASF svn:eol-style
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Juk
k
a Lauri
Zitti
n
g
T
I
K
A-84
-
A
dd MimeTypes
.
g
e
t
M
imeType(InputStream)
commit
|
commitdiff
|
tree
2007-10-19
Jukka Lauri Zitting
T
I
KA-84 - A
d
d M
i
meTyp
e
s
.
g
etMim
e
Type(
I
nputStr
e
am)
commit
|
commitdiff
|
tree
2007-10-19
J
u
kka Lau
r
i Zi
t
ting
TIKA-8
3
- Cre
a
te a org
.
apa
c
he
.
tika
.
sax packag
e
f
or
.
.
.
commit
|
commitdiff
|
tree
2007-10-18
J
u
kka L
a
uri Zi
t
ting
Set
s
vn:eol
-
style
t
o
na
t
i
v
e
commit
|
commitdiff
|
tree
2007-10-18
Jukka Lauri Zitting
Correct indenting (fo
u
r
space
s
instead of one as the
.
.
.
commit
|
commitdiff
|
tree
2007-10-16
Jukka
Lauri
Zitting
TI
K
A-
7
1
-
R
e
move Par
s
erConf
i
g and
P
a
rserFa
c
tory
commit
|
commitdiff
|
tree
2007-10-15
Jukka
L
auri Zi
t
ting
R
emove
d
an extra debug print
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Zitting
TIKA-70 - Better MIME information for the Open Document
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Z
i
tti
n
g
TIKA-70 - Better
MIME inf
o
r
m
ation
f
or the Open Document
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
Ju
k
ka La
u
ri Zi
t
t
i
ng
T
IKA-6
7
-
Add an auto-de
t
ecting Parser implementation
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Z
i
t
t
ing
T
I
KA
-
68 - Add d
u
mmy
p
ar
s
er classes to be used a
s
s
e
ntinels
commit
|
commitdiff
|
tree
2007-10-14
J
ukka
L
a
u
ri Zi
t
ti
n
g
TIKA-66 -
U
s
e Java 5
f
eatures
in org
.
apach
e
.
ti
k
a
.
mime
commit
|
commitdiff
|
tree
2007-10-14
J
u
k
ka Lauri
Zitting
TIKA-63 -
Avoid
m
ult
i
ple
p
asses
o
ver t
h
e input stream
.
.
.
commit
|
commitdiff
|
tree
2007-10-14
Jukka La
u
ri Zitting
T
I
KA-60 - Re
n
am
e
Micro
s
oft
p
a
r
ser clas
s
es
commit
|
commitdiff
|
tree
2007-10-14
Jukka Lauri Zit
t
ing
T
I
KA-
6
0
- Ren
a
me
M
icrosoft pa
r
ser classe
s
commit
|
commitdiff
|
tree
2007-10-13
J
u
kk
a
Lauri Zitti
n
g
TIK
A
-62
-
U
se
T
ikaConfig
.
getDefaultConfig
(
) inste
a
d
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
Ju
k
ka Lauri Zitt
i
ng
TI
K
A-57 - R
e
name org
.
apac
h
e
.
tika
.
ms to org
.
apache
.
t
ika
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
Jukka Lauri Zitting
TIKA-53 - XHTM
L
SA
X
events
f
rom pa
r
sers
commit
|
commitdiff
|
tree
2007-10-10
Juk
k
a Lauri Zittin
g
TIKA-4
0
- Ti
k
a
needs to support dive
r
s
e
c
ha
r
acter enco
d
in
g
s
commit
|
commitdiff
|
tree
2007-10-08
J
u
kka
Lauri Zitting
TIKA
-
41
- Re
s
ource files
o
cc
u
r
twice in jar file
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Zitting
TIK
A
-
45
- Rer
e
adableInputStream
n
eeds to be able to
.
.
.
commit
|
commitdiff
|
tree
2007-10-07
Jukka
L
auri Zitting
TIKA-48 -
M
erge M
S
Extractors
and Parsers
commit
|
commitdiff
|
tree
2007-10-07
J
ukka Lauri Zitt
i
ng
TIKA
-
4
6
- Use Me
t
ad
a
ta
in Pa
r
ser
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Zitting
T
IKA-46 - Us
e
Metada
t
a in Parser
commit
|
commitdiff
|
tree
next