repo.or.cz
/
tika.git
/
search
commit
grep
author
committer
pickaxe
?
search:
re
summary
|
log
|
graphiclog1
|
graphiclog2
|
commit
|
commitdiff
|
tree
|
refs
|
edit
|
fork
first
·
prev
·
next
TIKA-139: Add a composite parser
2008-04-11
Jukka Lauri Z
i
t
t
ing
TIKA-
1
39: Add a com
p
osite pars
e
r
commit
|
commitdiff
|
tree
2008-04-10
J
u
kk
a
Lauri Zit
t
ing
Replaced t
a
b
s
with spaces in tika-mim
e
t
y
pes
.
xml
commit
|
commitdiff
|
tree
2008-04-10
Jukka Lauri Zitt
i
ng
TIKA
-
1
1
3: Metada
t
a (such
as title) shou
l
d not
b
e
part
.
.
.
commit
|
commitdiff
|
tree
2008-04-08
Ju
k
ka Lauri Zitting
TIKA-1
3
8: Ignore HTML sty
l
e and sc
r
ipt content
commit
|
commitdiff
|
tree
2008-03-28
Jukka Lauri Zitt
i
ng
TIKA-13
4
: mvn package does not
p
roduce p
a
c
kages for
.
.
.
commit
|
commitdiff
|
tree
2008-03-28
Jukka
L
auri Zitti
n
g
TIKA
-
123: Structur
e
d M
S
Of
f
i
ce parsing
commit
|
commitdiff
|
tree
2008-03-28
J
ukka Lauri Zitt
i
ng
T
I
KA-123: Structured MS Office parsin
g
commit
|
commitdiff
|
tree
2008-03-28
Jukka Lauri Zi
t
ting
TIKA-132: Refactor Exce
l
extra
c
to
r
to parse per shee
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-27
Jukka Laur
i
Zitting
R
e
f
ormatted NOT
I
C
E t
o
be less ve
r
b
o
s
e
commit
|
commitdiff
|
tree
2008-03-27
Jukka Lauri
Zitting
T
I
KA
-
97: Tika GUI
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitting
TIKA-13
2
: Refactor
Excel extra
c
tor to
p
a
rse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri
Zitting
T
I
KA-132: Refactor Excel ex
t
ractor t
o
p
a
rse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitting
TIK
A
-
1
3
2:
Refactor Excel extractor to parse per
s
heet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka La
u
ri Z
i
tting
TIKA-132: Refactor Excel extractor to parse per
s
heet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka L
a
uri Zitt
i
ng
TI
K
A
-
1
3
2: Refactor Excel ex
t
rac
t
or to parse
p
er
s
hee
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitting
TIKA-132: Refactor Excel extractor
t
o parse
p
er sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri
Z
i
tting
TI
K
A-132: Re
f
actor E
x
cel extractor
to
parse
p
er sh
e
et
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
J
ukka Laur
i
Zitt
i
ng
TIKA-132: Refactor Excel extr
a
ctor to parse per s
h
eet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka
L
auri Zitting
TIKA-
1
3
2
:
Refactor Excel extractor t
o
pars
e
per s
h
eet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
J
u
kka Lauri Zitting
TIKA-132: Refactor E
x
cel extractor to par
s
e per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Ju
k
ka Lauri Zi
t
tin
g
TI
K
A-97: Tika GUI
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Z
i
t
t
i
ng
TIKA-133:
T
e
e
ContentHandler con
s
tructor should use
.
.
.
commit
|
commitdiff
|
tree
2008-03-19
Jukka Lauri Zitting
TIK
A
-
128: HTML parser should produce XHT
M
L SAX events
commit
|
commitdiff
|
tree
2008-03-19
Jukka La
u
r
i
Z
i
ttin
g
TIKA-131
:
L
azy XHTM
L
prefix
g
e
neration
commit
|
commitdiff
|
tree
2008-03-18
Jukka Lauri Zit
t
i
ng
TIKA-130: self-or-desce
n
dant axis
d
oes not
m
a
tch sel
f
.
.
.
commit
|
commitdiff
|
tree
2008-03-18
Jukka La
u
ri Zi
t
t
i
ng
TIKA
-
1
29: node() support for the streami
n
g XP
a
th utility
commit
|
commitdiff
|
tree
2008-03-09
J
uk
k
a Lauri Zit
t
ing
TIK
A
-127: A
d
d support for Visio files
commit
|
commitdiff
|
tree
2008-03-09
J
u
kk
a
L
auri Zitting
TIKA-126: Add Parser
.
parse(InputStream, Me
t
adata) for
.
.
.
commit
|
commitdiff
|
tree
2008-03-09
J
u
k
ka L
a
ur
i
Zittin
g
TIKA-123: Structu
r
e
d
M
S
Office parsing
commit
|
commitdiff
|
tree
2008-03-09
Juk
k
a Laur
i
Z
i
ttin
g
TIKA-123:
Structured MS
Office parsing
commit
|
commitdiff
|
tree
2008-02-19
Jukka Lauri Zitting
T
IKA-
1
23
:
Structured MS
Office parsing
commit
|
commitdiff
|
tree
2008-02-19
Jukka Lau
r
i Zitti
n
g
T
IKA-122: Use Commons
IO 1
.
4
commit
|
commitdiff
|
tree
2008-02-18
Jukka Lauri Zi
t
ti
n
g
TIKA-123:
S
tructured MS Office pars
i
n
g
commit
|
commitdiff
|
tree
2008-02-18
Jukka Lauri Zitt
i
ng
TIK
A
-123: Str
u
c
t
ured MS Office parsing
commit
|
commitdiff
|
tree
2008-02-18
Jukka La
u
ri Zitting
TIKA-123: Structur
e
d
MS Office parsing
commit
|
commitdiff
|
tree
2008-02-18
Ju
k
ka Lauri
Zit
t
ing
T
I
K
A-103: Exce
l
pars
i
n
g
ignores cell forma
t
ing
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lauri Z
i
tting
TIKA-123: Struc
t
ured M
S
Office
p
arsing
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lauri
Zitting
TIKA-123: Structured MS Office
parsing
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lauri Z
i
t
t
ing
TIKA-123: Structured M
S
Offi
c
e parsi
n
g
commit
|
commitdiff
|
tree
2008-02-17
Jukka L
a
uri Zitti
n
g
T
IKA-123: St
r
uctured MS Of
f
ice p
a
rsing
commit
|
commitdiff
|
tree
2008-01-26
Jukka
Lauri Zitting
T
IKA
-
118: Bouncy Cas
t
le b
i
narie
s
require US ex
p
orts
.
.
.
commit
|
commitdiff
|
tree
2008-01-25
Jukka Lauri Zitting
TIK
A
-96: Tika CLI
commit
|
commitdiff
|
tree
2008-01-22
Jukka La
u
ri Zi
t
t
i
ng
TIKA-97: Tika GUI
commit
|
commitdiff
|
tree
2008-01-22
Jukka Lauri Zitting
TIKA-97: Tika GUI
commit
|
commitdiff
|
tree
2008-01-22
Ju
k
ka Lauri Zitting
T
IK
A
-97:
Ti
k
a GUI
commit
|
commitdiff
|
tree
2008-01-22
Ju
k
k
a
Lau
r
i Z
i
tting
T
IKA-9
7
: Tika G
U
I
commit
|
commitdiff
|
tree
2008-01-21
Jukka La
u
ri Z
i
tt
i
ng
TIKA
-
1
15: Tik
a
package with all the d
e
pendencies
commit
|
commitdiff
|
tree
2008-01-21
J
u
k
ka Lauri Zitting
TIKA-
1
17: Drop JDOM and Jaxen dependencies
commit
|
commitdiff
|
tree
2008-01-21
Jukk
a
Lauri Zi
t
ting
TIKA-116: Streamin
g
parser fo
r
OpenDocument f
i
les
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lauri Zit
t
in
g
T
I
KA-109: Wor
d
P
a
rs
e
r fai
l
s on some
Wor
d
fi
l
es
commit
|
commitdiff
|
tree
2008-01-20
Jukka Lauri Zitting
TIKA-105: Excel pa
r
ser imple
m
entation
bas
e
d
on POI
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Jukka Lauri Z
i
tting
TIKA-1
0
5: Ex
c
el parser impl
e
m
entati
o
n ba
s
ed on POI
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Jukka
L
auri Zitting
TIKA-109: Word
P
a
r
se
r
fails on
some
W
o
r
d
f
i
les
commit
|
commitdiff
|
tree
2007-12-31
J
u
kka Lauri
Zitting
pom
.
xml: U
p
dated trunk version to 0
.
2-SNAPSHO
T
commit
|
commitdiff
|
tree
2007-12-26
Jukka
L
auri Zitting
T
I
KA-
1
11: Mi
s
sing
l
i
cense heade
r
s
commit
|
commitdiff
|
tree
2007-12-26
Ju
k
k
a
Laur
i
Zitting
T
I
KA-110: Add KEYS fi
l
e for Tika
commit
|
commitdiff
|
tree
2007-12-21
Jukka Lauri Zittin
g
TIKA-105 -
Exc
e
l parser implemen
t
a
t
i
on based on
POI
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukk
a
L
auri Zitting
T
IKA-106 - Remove dependency
on
J
akarta ORO - use JDK
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka
Laur
i
Z
i
t
t
ing
TIKA-104 - Add
util
i
ty methods to
t
h
r
ow IOEx
c
e
p
t
i
on
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
J
u
kk
a
Lauri Zitt
i
ng
TIK
A
-107
-
Remove us
e
of as
s
ertions for argument che
c
king
commit
|
commitdiff
|
tree
2007-11-25
J
ukka
L
a
ur
i
Z
i
t
t
ing
TI
K
A
-102 -
Par
s
er implementations l
o
ading
a
larg
e
a
mou
n
t
.
.
.
commit
|
commitdiff
|
tree
2007-11-25
Jukka Lauri Z
i
t
t
i
n
g
TIKA
-
102 - Parser implementations loading a larg
e
a
m
ount
.
.
.
commit
|
commitdiff
|
tree
2007-11-20
Jukka Lauri Zitting
T
I
KA-91: Add proper
a
tt
r
i
bution f
o
r code from te
x
tmini
n
g
.
org
commit
|
commitdiff
|
tree
2007-11-13
Jukka Lauri Zitting
T
I
K
A-100 - S
t
ructured PDF pa
r
s
i
ng
commit
|
commitdiff
|
tree
2007-11-06
Jukka Lauri Zitting
TI
K
A-87 - MimeTypes
should allow mo
d
ifi
c
ation of MIM
E
.
.
.
commit
|
commitdiff
|
tree
2007-11-05
Jukka Lauri Zitting
TIK
A
-
87 - MimeT
y
pes should
a
llow mod
i
fication of M
I
M
E
.
.
.
commit
|
commitdiff
|
tree
2007-11-04
Jukka L
a
uri
Zitting
TIKA-87 - Mi
m
eType
s
sho
u
l
d
a
llow modi
f
icati
o
n of MI
M
E
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Jukka L
a
uri Zitting
TIKA-87 -
MimeTypes should allow modification of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Jukka Lauri Zi
t
ting
TIKA-87
- MimeTypes s
h
ould allow
m
odificati
o
n
o
f
M
I
ME
.
.
.
commit
|
commitdiff
|
tree
2007-10-23
Juk
k
a
L
a
uri Zitting
TIKA-87 - MimeTypes sho
u
ld allow
modificat
i
on of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Jukka Lauri Zitting
TIKA-85 - Add glo
b
patterns
f
rom
the ASF svn:eol-style
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
J
u
kka
Lauri Zit
t
ing
TIKA-84 - Add MimeTypes
.
g
etMim
e
Type(InputStream)
commit
|
commitdiff
|
tree
2007-10-19
Jukk
a
Laur
i
Zitt
i
n
g
TIKA-8
4
- A
d
d Mime
T
ypes
.
ge
t
Mi
m
eType(Inp
u
tS
t
ream)
commit
|
commitdiff
|
tree
2007-10-19
Ju
k
k
a La
u
ri Zitting
TIKA-83 - C
r
e
ate a
org
.
apache
.
tika
.
sax package f
o
r
.
.
.
commit
|
commitdiff
|
tree
2007-10-18
Jukk
a
Lauri
Zi
t
ting
Set s
v
n:eol-style to native
commit
|
commitdiff
|
tree
2007-10-18
Jukka
L
auri Zitting
Correct indenti
n
g
(four
s
pa
c
es inste
a
d o
f
one as the
.
.
.
commit
|
commitdiff
|
tree
2007-10-16
Jukka L
a
uri Zitt
i
n
g
TIKA
-
71 - Remove ParserCon
f
ig
a
n
d ParserFactory
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Z
i
t
t
ing
Removed an extra debu
g
print
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri
Z
itting
TIKA
-
7
0 - Better MIME
information fo
r
the Open Document
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
Jukka
La
u
ri Zi
t
ti
n
g
TIKA-70
- B
e
tter MIME
informati
o
n for the Open
Document
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Zitting
TIKA-67
- Add an auto-detecting Parser implementation
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Zitting
TIKA-68
-
A
dd dummy parser classes to be used
as sentine
l
s
commit
|
commitdiff
|
tree
2007-10-14
Ju
k
ka Lauri
Z
i
t
ting
T
IKA-66
- U
s
e Java 5 fea
t
ures in or
g
.
apac
h
e
.
tika
.
mime
commit
|
commitdiff
|
tree
2007-10-14
Jukk
a
L
auri Zit
t
ing
TIKA-63 - Avoid multiple pa
s
ses o
v
er the input stream
.
.
.
commit
|
commitdiff
|
tree
2007-10-14
Jukk
a
Lau
r
i
Z
itting
TIKA-60 - Re
n
ame Microsoft par
s
er cla
s
ses
commit
|
commitdiff
|
tree
2007-10-14
Jukka L
a
uri Zitting
TIKA-60 - Re
n
ame Microsoft parser cla
s
ses
commit
|
commitdiff
|
tree
2007-10-13
Jukka Lauri Zitti
n
g
TIKA-62 - Us
e
Tik
a
C
o
n
f
ig
.
getDefau
l
t
Config() inst
e
a
d
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
Jukka Lau
r
i Zitting
TIKA-5
7
-
R
ename org
.
a
p
ache
.
tika
.
ms to
org
.
apache
.
tika
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
Jukka Lauri Zittin
g
T
I
KA-53 - XHTML SAX events fro
m
parsers
commit
|
commitdiff
|
tree
2007-10-10
Jukk
a
Lau
r
i Zit
t
ing
T
I
K
A
-
4
0 - Tika needs to
s
upport diverse character encodings
commit
|
commitdiff
|
tree
2007-10-08
Jukka Lauri
Zitting
TIKA
-
41 - Resource files occur twice in jar f
i
le
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Zitti
n
g
TIKA-45
- Rere
a
d
a
b
l
eInputStream
n
eeds t
o
be a
b
le to
.
.
.
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri
Z
itting
T
I
KA-48 - Merge MS E
x
tracto
r
s and Parser
s
commit
|
commitdiff
|
tree
2007-10-07
Jukka
Lauri Zitting
TIKA-46 - Use M
e
t
a
d
a
ta in P
a
r
ser
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Z
i
tting
TIKA-46 - Use Meta
d
a
t
a in Parser
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Zitti
n
g
Set sv
n
:eol-s
t
yle
to na
t
ive
commit
|
commitdiff
|
tree
2007-10-07
J
u
kka L
a
uri
Z
i
t
t
ing
TIKA-46
-
U
se M
e
t
a
data in Parser
commit
|
commitdiff
|
tree
2007-10-07
J
u
kka Lauri Zitting
TIKA-47
- Remove
T
ikaLog
g
er
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Zitting
TIKA-
4
3 - Pars
e
r interface
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri
Zit
t
i
ng
T
IKA-43 -
Parser interf
a
ce
commit
|
commitdiff
|
tree
next