repo.or.cz
/
tika.git
/
search
commit
grep
author
committer
pickaxe
?
search:
re
summary
|
log
|
graphiclog1
|
graphiclog2
|
commit
|
commitdiff
|
tree
|
refs
|
edit
|
fork
first
·
prev
·
next
TIKA-99: Support external parser programs
2008-07-09
Jukka Laur
i
Zitting
TIKA-54: Ou
t
look ms
g
parser
commit
|
commitdiff
|
tree
2008-07-01
Jukka La
u
ri Zitting
TIKA-1
4
6: Upgrade t
o
POI 3
.
1
commit
|
commitdiff
|
tree
2008-07-01
Jukka Lauri Zitting
TIKA-146:
Upgra
d
e to
P
O
I 3
.
1
commit
|
commitdiff
|
tree
2008-06-18
Jukka Lauri Zitting
TIKA-145:
S
epar
a
te
NOTICEs and LICENSEs fo
r
bi
n
ary
.
.
.
commit
|
commitdiff
|
tree
2008-06-18
Jukka Lau
r
i Z
i
tti
n
g
TIKA-144: Upgr
a
de nekoh
t
ml depende
n
cy
commit
|
commitdiff
|
tree
2008-06-06
J
u
kka Laur
i
Zitting
TIKA
-
118: Bo
u
n
c
ycastl
e
binaries requires US exports
.
.
.
commit
|
commitdiff
|
tree
2008-06-06
Jukka Lau
r
i
Zitting
typo
commit
|
commitdiff
|
tree
2008-06-06
Jukka
L
a
u
ri Zitting
TIKA-115: Tika package with all the depen
d
encies
commit
|
commitdiff
|
tree
2008-06-06
Jukk
a
La
u
r
i Zitting
TIKA-115: Ti
k
a p
a
ckage with all
the
dependencies
commit
|
commitdiff
|
tree
2008-06-06
Jukka Lauri Zitting
Modified
svn:ignor
e
to cover t
h
i
n
gs like "
.
check
s
tyle"
.
commit
|
commitdiff
|
tree
2008-06-06
Jukka Lauri Zitt
i
n
g
TIK
A
-14
3
: Add Pars
i
ngReade
r
commit
|
commitdiff
|
tree
2008-05-06
Ju
k
ka
L
auri Zitting
Simplified lo
g
4
j configurat
i
on fo
r
unit tests
commit
|
commitdiff
|
tree
2008-05-06
Jukka
L
auri Zitting
T
I
K
A
-92: Image
m
etadata extraction
commit
|
commitdiff
|
tree
2008-05-05
Jukka Laur
i
Zitting
TIKA-8
7
: MimeTypes shoul
d
allow
m
odification of MIME
.
.
.
commit
|
commitdiff
|
tree
2008-04-11
Jukka La
u
ri Z
i
t
t
ing
TIKA-139: Add a compo
s
ite par
s
er
commit
|
commitdiff
|
tree
2008-04-10
Jukka
Lauri Zitting
Re
p
laced tabs w
i
th spaces
i
n tika-mimetypes
.
xml
commit
|
commitdiff
|
tree
2008-04-10
Jukk
a
Lauri Zitti
n
g
TIKA-11
3
: Metadata (
s
uc
h
as titl
e
) sh
o
uld no
t
be part
.
.
.
commit
|
commitdiff
|
tree
2008-04-08
Jukka Lauri Zittin
g
TIKA-13
8
: Ignor
e
HTML style and scri
p
t content
commit
|
commitdiff
|
tree
2008-03-28
Jukka Lauri Zitting
TIKA-13
4
: mvn
package does not
p
r
o
duce p
a
ckages for
.
.
.
commit
|
commitdiff
|
tree
2008-03-28
J
ukk
a
Lauri
Zitting
TIKA-123:
Structu
r
ed
M
S Office p
a
rsing
commit
|
commitdiff
|
tree
2008-03-28
Ju
k
k
a
L
a
ur
i
Z
i
t
ting
T
IKA-123: S
t
ructured MS Offi
c
e
parsing
commit
|
commitdiff
|
tree
2008-03-28
Jukka
L
auri Zit
t
ing
TIKA-132:
R
efac
t
or Excel extra
c
tor to
par
s
e per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-27
Jukka Lauri Zitti
n
g
Reformatt
e
d NOTICE to be
l
ess
v
e
r
bose
commit
|
commitdiff
|
tree
2008-03-27
Jukka Lauri Zitt
i
ng
TIKA-97: T
i
ka GUI
commit
|
commitdiff
|
tree
2008-03-26
J
u
kka
L
a
u
ri Zitting
TIKA-1
3
2:
Ref
a
ctor
Excel e
x
t
r
a
c
tor
to pars
e
per shee
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Juk
k
a
L
auri Zitting
TIKA-
1
32
:
R
e
fact
o
r E
x
cel extr
a
ctor to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukk
a
L
auri
Z
itting
TIKA-132: Ref
a
ctor
Exce
l
extractor to parse pe
r
sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Ju
k
ka
Lau
r
i
Zitting
TIKA-132: Refactor Exce
l
extractor
t
o p
a
rse per
sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
J
u
kka Lauri
Z
it
t
ing
TIKA-132: Refacto
r
Exc
e
l extractor
to par
s
e
per s
h
eet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Ju
k
k
a
Lauri Zitti
n
g
TIKA-132: Refactor
Excel extractor to pa
r
se p
e
r s
h
eet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri
Z
itting
TIKA-132:
Refactor
E
xcel
extractor t
o
pa
r
se pe
r
sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka La
u
ri Zitting
T
IKA-132:
R
efactor
Excel ex
t
ractor to
par
s
e per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri Zitt
i
n
g
T
I
K
A
-
132
:
Refactor
Excel extr
a
c
tor to
p
arse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lau
r
i Zittin
g
T
IKA-132: Refactor Excel extractor to parse
p
er she
e
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka L
a
uri Zitting
TIKA-97: Tika GUI
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lau
r
i Zit
t
i
ng
TIKA-133: TeeCo
n
tentHand
l
er con
s
tructor should use
.
.
.
commit
|
commitdiff
|
tree
2008-03-19
Jukka Lauri Zitting
TIK
A
-1
2
8
: HTML
parser should
produce
XHTML SAX events
commit
|
commitdiff
|
tree
2008-03-19
J
ukka
L
auri
Z
it
t
ing
T
I
KA-1
3
1: L
a
zy
XHTML pref
i
x generation
commit
|
commitdiff
|
tree
2008-03-18
Jukka Lauri Zitting
TIKA-130: s
e
lf-or-descendant axis d
o
es not
m
atch
s
elf
.
.
.
commit
|
commitdiff
|
tree
2008-03-18
Ju
k
ka La
u
ri Zit
t
ing
TIKA-129: no
d
e() support
fo
r
th
e
streaming XPath utility
commit
|
commitdiff
|
tree
2008-03-09
Jukka
Lauri Zitting
TIKA-127: Add supp
o
r
t
f
or Visio files
commit
|
commitdiff
|
tree
2008-03-09
Jukka L
a
uri Zi
t
ting
TIKA-126: Add Parser
.
parse(InputStream, Metada
t
a) fo
r
.
.
.
commit
|
commitdiff
|
tree
2008-03-09
Jukka
Lauri Zitting
TIKA-123: Structured MS Office
p
a
rsin
g
commit
|
commitdiff
|
tree
2008-03-09
Jukka Lauri Zitting
TIKA-123: Structu
r
e
d
MS Of
f
ice parsing
commit
|
commitdiff
|
tree
2008-02-19
J
u
kka Lauri Zit
t
i
ng
TIKA-12
3
: Structured MS Off
i
c
e
parsing
commit
|
commitdiff
|
tree
2008-02-19
J
u
kka
L
aur
i
Zitting
TIKA-122: Use Commons IO 1
.
4
commit
|
commitdiff
|
tree
2008-02-18
Jukk
a
L
a
uri Zitting
TIKA-123
:
Structured MS Office
p
arsing
commit
|
commitdiff
|
tree
2008-02-18
Jukka
Lauri Zitti
n
g
TIKA-123: Struct
u
red MS Office par
s
ing
commit
|
commitdiff
|
tree
2008-02-18
Jukka L
a
uri
Z
itting
TIKA-1
2
3: Struct
u
red MS Office pars
i
n
g
commit
|
commitdiff
|
tree
2008-02-18
Ju
k
ka Lauri Zitting
TIKA-103: Ex
c
el parsing ignores cell f
o
rmat
i
ng
commit
|
commitdiff
|
tree
2008-02-17
Jukk
a
La
u
ri Zitt
i
ng
TIKA-123: Structured MS Office parsing
commit
|
commitdiff
|
tree
2008-02-17
Juk
k
a Laur
i
Zitting
T
I
KA-123: Stru
c
ture
d
MS Office parsing
commit
|
commitdiff
|
tree
2008-02-17
Jukk
a
L
a
uri Zitting
TIKA-123: Structured MS Office pa
r
sin
g
commit
|
commitdiff
|
tree
2008-02-17
J
u
kka
L
auri Zitting
TIKA-1
2
3:
S
t
r
uctured
M
S
Office parsing
commit
|
commitdiff
|
tree
2008-01-26
J
u
k
ka Lauri Zitt
i
ng
TI
K
A-118: Boun
c
y
Castle binaries require
US exports
.
.
.
commit
|
commitdiff
|
tree
2008-01-25
J
u
k
k
a
L
a
uri Z
i
ttin
g
TIKA-96: Tika CLI
commit
|
commitdiff
|
tree
2008-01-22
J
u
kka Lauri Zitting
TIK
A
-97: Ti
k
a GUI
commit
|
commitdiff
|
tree
2008-01-22
Juk
k
a Laur
i
Zitting
TI
K
A-
9
7: Ti
k
a GUI
commit
|
commitdiff
|
tree
2008-01-22
J
u
kka Lauri Zittin
g
TIKA-97:
T
i
k
a GUI
commit
|
commitdiff
|
tree
2008-01-22
Ju
k
ka Lauri Zitting
TIKA-
9
7: Ti
k
a GUI
commit
|
commitdiff
|
tree
2008-01-21
Jukka
Lauri Zitt
i
ng
T
I
KA-115
:
T
ika package with all the dependencies
commit
|
commitdiff
|
tree
2008-01-21
J
u
kka La
u
ri Zitting
TIKA-117:
Drop JDOM and J
a
xen de
p
endencies
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lauri Zitt
i
ng
TIKA-116:
S
treaming parser for OpenDo
c
ume
n
t fi
l
es
commit
|
commitdiff
|
tree
2008-01-21
Jukk
a
Lauri Zitting
TIK
A
-109:
Word
P
arser
f
ails on
s
ome Word files
commit
|
commitdiff
|
tree
2008-01-20
Jukka Lau
r
i
Zitting
TIKA-10
5
: Excel pars
e
r i
m
plementati
o
n based o
n
POI
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
J
u
kka Lauri Zitting
TIKA-105: E
x
cel parser implementation base
d
on POI
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Jukka Lau
r
i
Z
itti
n
g
TIK
A
-
109
:
Word
P
arser fails on some Word files
commit
|
commitdiff
|
tree
2007-12-31
J
u
kka
L
auri
Zitting
p
om
.
xml: Updated tru
n
k
v
e
rs
i
on to 0
.
2-SNAP
S
HOT
commit
|
commitdiff
|
tree
2007-12-26
Jukka Lauri Zitting
TIKA
-
111: Missing license
headers
commit
|
commitdiff
|
tree
2007-12-26
Jukk
a
Lauri
Zitting
TIKA-110: Ad
d
K
EYS file
f
or
Tika
commit
|
commitdiff
|
tree
2007-12-21
Jukka Lauri Zitting
TIKA-105 - Excel p
a
rser im
p
lementation bas
e
d on POI
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka Lauri
Z
itting
T
I
KA-
1
06 -
R
e
move dependency on Jaka
r
ta
O
R
O
-
use JDK
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka La
u
r
i
Z
i
tting
T
IKA-104 -
A
d
d util
i
ty
methods to throw I
O
Exception
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka
L
auri
Zi
t
tin
g
TIKA-
1
0
7
-
R
e
move u
s
e
o
f assertio
n
s for argument c
h
e
c
king
commit
|
commitdiff
|
tree
2007-11-25
Ju
k
ka
Lauri Zitting
TIKA
-
10
2
-
Parser implem
e
ntations loadi
n
g
a
large amount
.
.
.
commit
|
commitdiff
|
tree
2007-11-25
Jukka Lauri Z
i
tting
TIKA-102
-
Pars
e
r implementations l
o
ad
i
n
g
a la
r
ge amount
.
.
.
commit
|
commitdiff
|
tree
2007-11-20
J
u
k
k
a Lauri
Zitting
TI
K
A-91: Add proper at
t
ribution for c
o
de
f
r
o
m
textmining
.
org
commit
|
commitdiff
|
tree
2007-11-13
Jukk
a
Lauri Zitting
TIKA-100 - Struc
t
ured PDF parsing
commit
|
commitdiff
|
tree
2007-11-06
Jukka
L
auri
Z
i
tting
TIKA-87 - M
i
meTypes shoul
d
allow modification of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-05
Jukka Lauri
Z
itting
TIK
A
-87 - M
i
meTypes should allow modi
f
ication o
f
MI
M
E
.
.
.
commit
|
commitdiff
|
tree
2007-11-04
J
u
kka Lauri Zitting
TIKA
-
87 - MimeTypes shou
l
d allow mod
i
fication
o
f MI
M
E
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
J
u
kka Lauri Zittin
g
TIKA-87 -
Mime
T
y
pes should allo
w
modification of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Jukka Lauri Zitting
TIKA-87 - MimeTypes
should allow modifi
c
at
i
on of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-10-23
J
ukka
L
auri Zitting
TIKA-87 - MimeTypes should allow modificat
i
on
o
f MI
M
E
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Jukka Lauri Zitting
TIKA-85 - Add glob patterns from the AS
F
s
vn:eol-s
t
yle
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Jukka Lauri Zitting
TIKA-84 - Add MimeTyp
e
s
.
getMimeTyp
e
(InputS
t
ream)
commit
|
commitdiff
|
tree
2007-10-19
Jukka Lauri
Z
itting
TIK
A
-8
4
- Add M
i
meTy
p
e
s
.
g
etMimeTyp
e
(Inp
u
tStream)
commit
|
commitdiff
|
tree
2007-10-19
Jukka
L
auri Z
i
tti
n
g
TIKA-83 - Create a org
.
apache
.
t
i
ka
.
s
a
x
p
a
ckage for
.
.
.
commit
|
commitdiff
|
tree
2007-10-18
Jukka
Lauri Zitting
S
et svn:eol-style to
n
ative
commit
|
commitdiff
|
tree
2007-10-18
Jukka Lauri Zi
t
t
i
ng
C
o
rrect indentin
g
(four spac
e
s inst
e
ad of one
a
s the
.
.
.
commit
|
commitdiff
|
tree
2007-10-16
Jukk
a
Lauri Zitting
TIKA-
7
1 -
R
emove Parser
C
onfig
a
nd
ParserFa
c
tory
commit
|
commitdiff
|
tree
2007-10-15
J
u
k
ka
L
auri Zitting
R
emoved a
n
extra debug
p
ri
n
t
commit
|
commitdiff
|
tree
2007-10-15
J
u
kka Lauri
Zitting
TI
K
A-70 - B
e
tter MIME info
r
mati
o
n for the
O
p
e
n Doc
u
m
e
nt
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
Ju
k
ka Lauri Zitt
i
ng
TIKA-7
0
- Better
M
I
M
E information for the
O
pen Do
c
ument
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Z
i
tting
TI
K
A-6
7
-
A
dd
a
n auto-detect
i
ng Parser implementation
commit
|
commitdiff
|
tree
2007-10-15
J
u
k
k
a
Lau
r
i Zitting
TIKA-68 - Add dummy parser classes
t
o be used as sentinels
commit
|
commitdiff
|
tree
2007-10-14
Jukka La
u
ri
Zitting
T
IKA-66 - Use Java
5 fe
a
t
u
r
es in org
.
a
pache
.
tika
.
mime
commit
|
commitdiff
|
tree
2007-10-14
Jukka
L
auri Zitting
TIKA-63
-
Avoid multip
l
e passes over the input str
e
am
.
.
.
commit
|
commitdiff
|
tree
2007-10-14
Jukka Lauri Zitting
TIKA-60 - Rename Microsoft par
s
e
r c
l
asses
commit
|
commitdiff
|
tree
2007-10-14
Ju
k
ka
Lauri Z
i
tting
TI
K
A
-60 -
R
enam
e
Microsoft parser classes
commit
|
commitdiff
|
tree
next