repo.or.cz
/
tika.git
/
search
commit
grep
author
committer
pickaxe
?
search:
re
summary
|
log
|
graphiclog1
|
graphiclog2
|
commit
|
commitdiff
|
tree
|
refs
|
edit
|
fork
first
·
prev
·
next
TIKA-113: Metadata (such as title) should not be part of content
2008-04-10
J
u
k
k
a
Lauri Zitt
i
ng
TIK
A
-113
:
Metadata
(
s
u
ch as t
i
tle)
s
h
ould not b
e
part
.
.
.
commit
|
commitdiff
|
tree
2008-04-08
Ju
k
ka Laur
i
Zitting
TIKA-
1
3
8: Ign
o
re HT
M
L sty
l
e and scrip
t
content
commit
|
commitdiff
|
tree
2008-03-28
Jukk
a
Laur
i
Zitting
TIKA-134: mvn package
does not produce packag
e
s for
.
.
.
commit
|
commitdiff
|
tree
2008-03-28
Jukka
L
auri Zitting
TIKA-123: Structured MS Off
i
c
e
pars
i
ng
commit
|
commitdiff
|
tree
2008-03-28
Jukka La
u
ri Zi
t
ti
n
g
T
IKA-123: Struct
u
r
ed M
S
Off
i
ce
parsing
commit
|
commitdiff
|
tree
2008-03-28
Jukka Lauri Z
i
tting
T
IKA-132: R
e
f
actor Exc
e
l extractor to par
s
e per
she
e
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-27
Jukka Lauri Zi
t
ting
Re
f
ormatted NOTICE to
be
l
ess verbose
commit
|
commitdiff
|
tree
2008-03-27
Jukka Lau
r
i Zit
t
ing
TIKA-97: Tika GUI
commit
|
commitdiff
|
tree
2008-03-26
J
ukka Lauri
Z
itting
TIKA
-
1
32: Refactor E
x
cel ext
r
actor to parse per
sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Ju
k
ka
L
auri
Zitting
TIKA-132:
Refactor
E
xcel extractor to parse
per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitting
TIKA
-
132:
R
efactor Excel extractor
t
o
pa
r
s
e per
she
e
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitting
TIKA
-
132: R
e
fa
c
t
or
Excel extractor to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
J
u
kka Lauri Zi
t
ting
T
I
K
A
-132: Refa
c
to
r
Excel extractor to p
a
rse per s
h
eet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri
Zitting
TIKA-132: Refactor Excel e
x
tractor t
o
pa
r
se
p
er s
h
eet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zit
t
ing
T
IKA-132: Refactor E
x
cel extractor to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitting
TIKA-132: Refac
t
or Excel extractor
to parse per
s
h
eet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
J
ukka Lauri Zitting
TIKA-132: Re
f
a
ctor
E
x
c
e
l
extractor to
par
s
e per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitting
T
IKA-132:
R
efacto
r
Excel
e
xtr
a
ctor to
p
arse per
sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
J
ukka Lauri Zitting
T
IKA-
9
7: Tika GU
I
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitting
TIKA-133:
TeeConte
n
tHan
d
ler cons
t
ructor
s
hould use
.
.
.
commit
|
commitdiff
|
tree
2008-03-19
Jukka La
u
ri Zitting
TIKA-128: HTML
par
s
e
r
sho
u
ld pro
d
uce XHTML SAX event
s
commit
|
commitdiff
|
tree
2008-03-19
Jukk
a
Lauri
Z
it
t
ing
TIKA-131: Laz
y
XHTM
L
pr
e
fix
gen
e
r
a
tion
commit
|
commitdiff
|
tree
2008-03-18
Jukka Lauri
Z
itting
TIKA-1
3
0: se
l
f-o
r
-d
e
sce
n
d
an
t
axis
d
o
es
not match self
.
.
.
commit
|
commitdiff
|
tree
2008-03-18
J
uk
k
a La
u
ri Zittin
g
TIKA-129:
n
ode() suppor
t
f
o
r the streaming XPath utility
commit
|
commitdiff
|
tree
2008-03-09
Jukka Lauri Zitting
TIKA-127: Add support for Visio files
commit
|
commitdiff
|
tree
2008-03-09
J
ukka Laur
i
Zit
t
i
n
g
TIKA-126: A
d
d Parser
.
p
arse(InputStream, Metadata) fo
r
.
.
.
commit
|
commitdiff
|
tree
2008-03-09
Jukka Lauri
Z
itting
T
I
KA-123: Structur
e
d
MS Office parsing
commit
|
commitdiff
|
tree
2008-03-09
Jukka La
u
r
i Z
i
tti
n
g
TI
K
A-12
3
:
Str
u
cture
d
MS O
f
fice pars
i
ng
commit
|
commitdiff
|
tree
2008-02-19
Jukka L
a
uri
Zitt
i
ng
TIK
A
-123: St
r
uctured MS Office parsing
commit
|
commitdiff
|
tree
2008-02-19
Jukka L
a
uri Zitting
TIKA-122: Use
Co
m
mons IO
1
.
4
commit
|
commitdiff
|
tree
2008-02-18
Jukka Lauri Zitti
n
g
T
I
KA
-
123:
S
t
ru
c
tured MS Office pars
i
ng
commit
|
commitdiff
|
tree
2008-02-18
Ju
k
k
a Lauri Zit
t
ing
TIKA-123: Structured MS Office parsing
commit
|
commitdiff
|
tree
2008-02-18
Juk
k
a Lauri Zi
t
ting
TIKA-123: Struct
u
red
MS Off
i
ce parsi
n
g
commit
|
commitdiff
|
tree
2008-02-18
Ju
k
k
a
Lauri
Zi
t
ti
n
g
T
IKA-10
3
: Excel parsing ig
n
or
e
s ce
l
l fo
r
mating
commit
|
commitdiff
|
tree
2008-02-17
Jukka
Lauri Zitting
TIKA-123: St
r
uctured MS O
f
fice par
s
ing
commit
|
commitdiff
|
tree
2008-02-17
Juk
k
a
Lauri Zitt
i
ng
TIKA
-
1
2
3: Structured MS Offic
e
pars
i
ng
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lauri Zitting
T
IKA
-
123: Structured
M
S Office p
a
rs
i
ng
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lauri Zitting
TI
K
A-123: Stru
c
t
u
red MS O
f
fi
c
e parsing
commit
|
commitdiff
|
tree
2008-01-26
J
u
kka Lauri Zi
t
t
i
n
g
TIKA-118: Bounc
y
Castle binaries require US
e
xpo
r
ts
.
.
.
commit
|
commitdiff
|
tree
2008-01-25
Jukka
L
a
u
ri Zitting
TIKA-9
6
: Tika CLI
commit
|
commitdiff
|
tree
2008-01-22
Jukka Laur
i
Zi
t
ting
TI
K
A-97: Tika GUI
commit
|
commitdiff
|
tree
2008-01-22
Jukka
L
auri Zi
t
t
ing
TIKA-9
7
: Tika GUI
commit
|
commitdiff
|
tree
2008-01-22
Jukka Lauri Zitting
TIKA
-
97: Tika
GUI
commit
|
commitdiff
|
tree
2008-01-22
Jukka Laur
i
Z
i
tt
i
ng
TIKA-97:
Tika GUI
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lauri Zitting
T
IKA-115: Tika p
a
c
kag
e
with all the
d
ependencies
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lauri Zitt
i
ng
TIKA
-
117: Drop JDOM and Jaxen dependencies
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lauri Zitting
TIKA-116: Streaming par
s
e
r
for Op
e
nDocum
e
nt files
commit
|
commitdiff
|
tree
2008-01-21
J
u
kka Lau
r
i Z
i
tting
TIKA-109: WordParse
r
fails on some
Word files
commit
|
commitdiff
|
tree
2008-01-20
Ju
k
ka La
u
ri Zitting
TIKA-
1
05:
Ex
c
el parser implementa
t
i
o
n based o
n
PO
I
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Jukka Laur
i
Z
i
t
t
ing
T
IK
A
-10
5
: Excel par
s
er implemen
t
ation bas
e
d on P
O
I
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Jukka Lauri
Zitting
T
I
KA-109: Wo
r
dParser
fails on
s
om
e
W
o
rd
f
iles
commit
|
commitdiff
|
tree
2007-12-31
Jukka Lauri Zit
t
ing
pom
.
x
m
l:
Updated trunk versi
o
n to
0
.
2-SNAPSHOT
commit
|
commitdiff
|
tree
2007-12-26
Jukka L
a
ur
i
Zitting
TI
K
A-111:
M
i
ssing lice
n
se headers
commit
|
commitdiff
|
tree
2007-12-26
Jukka L
a
ur
i
Zitting
TIKA-11
0
: Add KEYS
f
il
e
f
o
r Tik
a
commit
|
commitdiff
|
tree
2007-12-21
J
ukka Lauri Zitt
i
ng
TIKA-105 - Excel
parser implem
e
ntation base
d
on POI
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka
Lauri Zitting
TIKA-106 -
Remove dependency on Jakarta
O
RO - use JDK
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka Lauri Zitt
i
ng
TI
K
A-104 - Add utility met
h
ods
t
o throw IOExcep
t
ion
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka Lauri Zittin
g
TIKA-1
0
7 - Remove use of assert
i
ons for argument checking
commit
|
commitdiff
|
tree
2007-11-25
Jukk
a
Lauri Zitting
TI
K
A-102 - Pa
r
se
r
implementations loading a la
r
ge amount
.
.
.
commit
|
commitdiff
|
tree
2007-11-25
Jukka Lauri Zitting
TIKA-102 - Parser imp
l
ementa
t
io
n
s lo
a
ding a larg
e
a
m
oun
t
.
.
.
commit
|
commitdiff
|
tree
2007-11-20
J
u
kka Lauri Zitting
TIKA-91: Add
proper attri
b
u
t
i
on
for code
f
rom
t
e
x
t
mining
.
org
commit
|
commitdiff
|
tree
2007-11-13
J
uk
k
a La
u
ri Zittin
g
T
IK
A
-100
-
Structur
e
d PDF parsing
commit
|
commitdiff
|
tree
2007-11-06
J
u
k
ka Lauri Zitt
i
ng
TIKA-8
7
-
M
imeTypes should allow modification
o
f
MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-05
J
u
kka Lauri
Zit
t
ing
TIKA-87 - MimeTypes sh
o
uld allow modifi
c
a
t
i
on of MIM
E
.
.
.
commit
|
commitdiff
|
tree
2007-11-04
Jukka
L
auri
Zitting
TIKA-87 - MimeTypes should allow mod
i
fi
c
ation
o
f MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Jukka Lauri Zitt
i
ng
TI
K
A-87 - MimeTypes sh
o
uld allow
m
odifi
c
at
i
on
o
f MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Jukka
L
auri Zitting
T
I
KA-87 - Mi
m
eT
y
pes should allow modification of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-10-23
Jukka Lauri
Zit
t
ing
T
I
KA-87 - Mi
m
eTypes s
h
ould a
l
lo
w
m
o
dification
of MIM
E
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Jukka Lauri
Zitting
TIKA-85 - Add glob patterns from t
h
e ASF svn:eol-style
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Jukka Lauri Zitting
TIKA-84 - Add MimeTy
p
es
.
getMimeType(In
p
utStream)
commit
|
commitdiff
|
tree
2007-10-19
J
ukka Lauri Zitting
TI
K
A-84 - Add MimeTypes
.
getMimeType(Input
S
tream
)
commit
|
commitdiff
|
tree
2007-10-19
Jukka Lauri Zittin
g
TI
K
A-83 -
C
rea
t
e a or
g
.
apach
e
.
tik
a
.
sax pack
a
ge
f
or
.
.
.
commit
|
commitdiff
|
tree
2007-10-18
Jukka
L
auri Zit
t
i
ng
Se
t
svn:eol-style to native
commit
|
commitdiff
|
tree
2007-10-18
Jukka Lauri Zitti
n
g
Corre
c
t
i
n
d
e
n
t
i
ng (four s
p
a
c
es inst
e
ad of on
e
as the
.
.
.
commit
|
commitdiff
|
tree
2007-10-16
J
u
k
k
a L
a
u
ri Zitti
n
g
TIKA-71 - Remove Parse
r
Confi
g
and Parse
r
Factory
commit
|
commitdiff
|
tree
2007-10-15
J
ukka Lauri Zitt
i
ng
R
emove
d
an extra deb
u
g
print
commit
|
commitdiff
|
tree
2007-10-15
J
ukka
L
a
uri Zit
t
ing
TIKA-70 - Better
MIME i
n
fo
r
mation for
the
O
pen Document
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
Juk
k
a
Lauri
Zitt
i
ng
T
I
KA-7
0
- Better MIME in
f
ormation f
o
r th
e
O
pen
D
ocume
n
t
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
J
ukka Lauri
Zitti
n
g
TI
K
A-67 - Ad
d
an auto-det
e
c
t
ing Parser implem
e
ntation
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Zitting
T
I
KA-68 - Add dummy parser cl
a
sses
t
o be use
d
as sentinels
commit
|
commitdiff
|
tree
2007-10-14
Jukka L
a
uri Zitting
TIKA-66 - Use Java 5 features
in
o
rg
.
apache
.
tika
.
m
i
me
commit
|
commitdiff
|
tree
2007-10-14
Jukka Lauri Z
i
tti
n
g
TIKA-63 - A
v
oid mu
l
tiple passes ov
e
r
th
e
inpu
t
s
t
ream
.
.
.
commit
|
commitdiff
|
tree
2007-10-14
Jukka Lauri Zitti
n
g
TIKA-60 - Rename
Microsof
t
par
s
er
c
l
ass
e
s
commit
|
commitdiff
|
tree
2007-10-14
Jukka Lau
r
i
Z
it
t
ing
T
IKA-60 - Rename M
i
crosoft
p
ar
s
er c
l
as
s
e
s
commit
|
commitdiff
|
tree
2007-10-13
Jukka Lauri
Zitting
TIKA-62 - Use TikaConfig
.
g
etDefaultConfig()
inste
a
d
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
Jukka Lauri Zitting
TIKA-57 - R
e
name or
g
.
apac
h
e
.
t
ika
.
ms to org
.
apache
.
tika
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
J
u
k
ka Lauri Zi
t
ting
TIKA
-
53
-
XHTM
L
SAX events from
parsers
commit
|
commitdiff
|
tree
2007-10-10
Jukka Lauri
Zitting
T
I
K
A-4
0
- Tik
a
ne
e
ds to
sup
p
ort diverse character encodi
n
g
s
commit
|
commitdiff
|
tree
2007-10-08
Jukka Lauri Zitting
T
I
KA-41
- Resource files occur twice in
j
ar fil
e
commit
|
commitdiff
|
tree
2007-10-07
Jukka La
u
ri
Zitting
TIKA-45
-
R
eread
a
bleInputStream needs to be able to
.
.
.
commit
|
commitdiff
|
tree
2007-10-07
J
u
k
ka L
a
uri Zitting
TIKA-48 - Merg
e
MS Extracto
r
s and Parsers
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Zitti
n
g
TIK
A
-46 - Use Metada
t
a in Parser
commit
|
commitdiff
|
tree
2007-10-07
J
uk
k
a
Lauri Zitting
TIKA-46 - Use
Metadata in Parser
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Zitting
Set svn:e
o
l-style to nat
i
ve
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Zitting
T
I
KA-46 - Use M
e
tadata in Pa
r
ser
commit
|
commitdiff
|
tree
2007-10-07
Jukk
a
L
auri Zitting
TIKA-47 - Re
m
ove TikaLogger
commit
|
commitdiff
|
tree
2007-10-07
Jukka
L
auri
Zitting
TIKA-43
- Parser interfa
c
e
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri
Z
itt
i
ng
TIK
A
-43 - Parser in
t
erface
commit
|
commitdiff
|
tree
2007-10-05
Jukka Lauri Zitt
i
ng
TI
K
A
-42 -
C
ontent class
nee
d
s
(
String
,
Stri
n
g,
S
trin
g
.
.
.
commit
|
commitdiff
|
tree
2007-10-05
J
u
kka Laur
i
Z
i
tting
TIKA-44 - Spaces fo
r
indentation
commit
|
commitdiff
|
tree
next