repo.or.cz
/
tika.git
/
search
commit
grep
author
committer
pickaxe
?
search:
re
summary
|
log
|
graphiclog1
|
graphiclog2
|
commit
|
commitdiff
|
tree
|
refs
|
edit
|
fork
first
·
prev
·
next
TIKA-146: Upgrade to POI 3.1
2008-07-01
Ju
k
ka Lauri Zitting
T
IKA-1
4
6:
U
pgrade to PO
I
3
.
1
commit
|
commitdiff
|
tree
2008-06-18
Jukka Lauri Zitt
i
ng
TIK
A
-
1
45: S
e
parat
e
NOTICEs and LICENSEs for bina
r
y
.
.
.
commit
|
commitdiff
|
tree
2008-06-18
Jukka
L
a
u
ri Z
i
tti
n
g
TIKA-14
4
:
Upgrade nek
o
html depen
d
e
n
cy
commit
|
commitdiff
|
tree
2008-06-06
Jukka La
u
ri Zitting
TIK
A
-118
:
Bouncyca
s
tle binaries requires US export
s
.
.
.
commit
|
commitdiff
|
tree
2008-06-06
Jukka
Lauri Z
i
tting
typo
commit
|
commitdiff
|
tree
2008-06-06
Ju
k
ka Lauri Z
i
tting
TIKA-115: Tika package with
a
ll the dependen
c
ies
commit
|
commitdiff
|
tree
2008-06-06
J
u
kka Lauri Zitti
n
g
TIKA-115: T
i
ka pac
k
age
w
i
th a
l
l th
e
dependencies
commit
|
commitdiff
|
tree
2008-06-06
Jukka Lauri Zitting
Modif
i
e
d
svn:ignore to cover
things
l
ike
"
.
chec
k
style"
.
commit
|
commitdiff
|
tree
2008-06-06
J
u
kk
a
Lauri Zitting
TIKA-143
:
Add
P
a
r
singRead
e
r
commit
|
commitdiff
|
tree
2008-05-06
Jukk
a
Lauri Zitting
Si
m
p
lified log4j configu
r
ation for
u
n
it t
e
sts
commit
|
commitdiff
|
tree
2008-05-06
Jukka L
a
uri Zitting
TIKA
-
92: Image metadata e
x
traction
commit
|
commitdiff
|
tree
2008-05-05
Jukk
a
Lauri Zit
t
ing
TIKA-87: M
i
meTypes sh
o
ul
d
all
o
w modification o
f
MIME
.
.
.
commit
|
commitdiff
|
tree
2008-04-11
J
u
kka Lauri Zitting
T
I
KA-139: Add a
composite parser
commit
|
commitdiff
|
tree
2008-04-10
Jukka L
a
uri
Zitting
Replaced tabs with space
s
in tika-mimetypes
.
xm
l
commit
|
commitdiff
|
tree
2008-04-10
Jukka
Lauri Zitting
TIKA-113
:
Me
t
ada
t
a (such as title)
s
hould
n
ot
b
e part
.
.
.
commit
|
commitdiff
|
tree
2008-04-08
Ju
k
ka Lauri Zit
t
i
n
g
TI
K
A-138
:
Ig
n
ore HTML style
and script
content
commit
|
commitdiff
|
tree
2008-03-28
J
ukka Lauri Z
i
tting
TIKA-134: m
v
n
pa
c
kage does not pro
d
uc
e
pac
k
ages for
.
.
.
commit
|
commitdiff
|
tree
2008-03-28
Ju
k
ka La
u
ri Zit
t
ing
TIKA-1
2
3: Structure
d
MS
Offic
e
par
s
ing
commit
|
commitdiff
|
tree
2008-03-28
Jukka Lauri Zitt
i
ng
TIKA-1
2
3: Structured MS
O
f
f
ice parsing
commit
|
commitdiff
|
tree
2008-03-28
Jukka Lauri Zitting
TIKA-132:
Ref
a
ctor Exce
l
extractor to parse
pe
r
sh
e
et
.
.
.
commit
|
commitdiff
|
tree
2008-03-27
J
uk
k
a Lauri
Zittin
g
Refor
m
att
e
d NOTICE to be less verbose
commit
|
commitdiff
|
tree
2008-03-27
Jukka Lauri Zi
t
ting
T
I
K
A
-97:
Tika
GUI
commit
|
commitdiff
|
tree
2008-03-26
Juk
k
a Lauri Zitting
TIKA-132: Ref
a
ctor Excel extra
c
tor to parse
p
er shee
t
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
J
u
kka Lauri Zitting
TIKA-132
:
Refactor Excel e
x
t
ractor to
p
arse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lau
r
i Zit
t
ing
TIKA-132: Ref
a
ctor Exce
l
extract
o
r to parse
p
er s
h
eet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Laur
i
Zi
t
ting
TIKA-132:
Ref
a
ctor Excel ex
t
ractor
to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Juk
k
a
La
u
ri Zitt
i
ng
T
I
KA-1
3
2
: Refactor Excel extractor to parse per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
J
ukk
a
Lauri
Zitting
TIKA-132: Refactor Excel extract
o
r to pa
r
se per sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka
L
auri Zitti
n
g
TIKA-
1
32: Refactor Ex
c
el
e
xtractor to
p
arse per
s
heet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka Lauri
Z
itti
n
g
T
I
KA-132: Refactor Excel extractor to
p
a
r
se
p
er
sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukka
Lauri Z
i
tting
TIKA-1
3
2: Ref
a
c
tor Excel e
x
tractor t
o
parse per
sheet
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Jukk
a
La
u
ri
Z
i
tting
TI
K
A-1
3
2
:
R
efacto
r
Excel extracto
r
to pars
e
per sh
e
et
.
.
.
commit
|
commitdiff
|
tree
2008-03-26
Juk
k
a Lauri Z
i
ttin
g
TI
K
A
-
97: Tika
GUI
commit
|
commitdiff
|
tree
2008-03-26
J
u
kka L
a
u
ri
Z
itti
n
g
TIKA-133: TeeContentHandler con
s
tructor
should
u
se
.
.
.
commit
|
commitdiff
|
tree
2008-03-19
J
ukka La
u
ri Z
i
tting
TI
K
A-128: HTML parser should
p
roduce XHTML S
A
X events
commit
|
commitdiff
|
tree
2008-03-19
Jukka Lauri Z
i
tting
TI
K
A-
1
31: Lazy X
H
TML pre
f
ix gene
r
a
tion
commit
|
commitdiff
|
tree
2008-03-18
Ju
k
ka Lauri Zitting
T
IKA-130: s
e
l
f
-or-descendant
a
xis do
e
s not mat
c
h self
.
.
.
commit
|
commitdiff
|
tree
2008-03-18
Juk
k
a
Lauri Zitting
TIKA-129: node() s
u
p
port fo
r
th
e
streami
n
g
XPath uti
l
ity
commit
|
commitdiff
|
tree
2008-03-09
Jukka Lauri Zitting
TIKA-1
2
7: Add su
p
port for Visio files
commit
|
commitdiff
|
tree
2008-03-09
Jukka Lau
r
i
Z
itting
TIKA-12
6
:
Add Parser
.
parse(InputS
t
re
a
m, Metada
t
a)
f
o
r
.
.
.
commit
|
commitdiff
|
tree
2008-03-09
Ju
k
ka Lauri Zi
t
ting
TIKA-123
:
Structured MS Offi
c
e
parsin
g
commit
|
commitdiff
|
tree
2008-03-09
Jukk
a
Laur
i
Z
i
t
t
ing
TIKA-123: Str
u
ctured MS Off
i
c
e
p
a
rsing
commit
|
commitdiff
|
tree
2008-02-19
Jukka L
a
uri Zitti
n
g
TIKA-12
3
: Struct
u
red MS Offic
e
parsing
commit
|
commitdiff
|
tree
2008-02-19
Jukka La
u
ri
Z
itting
TIKA-122
:
Use Commo
n
s IO 1
.
4
commit
|
commitdiff
|
tree
2008-02-18
Ju
k
ka Lauri
Z
itting
T
I
KA-123: Structure
d
M
S
Office parsing
commit
|
commitdiff
|
tree
2008-02-18
J
u
kka Lauri Zitting
TIKA
-
123: Struc
t
ure
d
MS Of
f
ice parsing
commit
|
commitdiff
|
tree
2008-02-18
Jukka Lauri Zitting
TIKA-123: S
t
ruct
u
red MS Office
parsing
commit
|
commitdiff
|
tree
2008-02-18
J
u
kka
Lauri Zitting
TIKA-103:
Excel parsing ignor
e
s
cell for
m
a
t
i
ng
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lau
r
i
Zit
t
ing
TI
K
A
-
123: Stru
c
t
u
red MS Office parsing
commit
|
commitdiff
|
tree
2008-02-17
Ju
k
ka Lauri
Z
ittin
g
TIKA-1
2
3: Str
u
ctur
e
d MS Office parsing
commit
|
commitdiff
|
tree
2008-02-17
Jukka Laur
i
Zitting
TIKA-123: Struct
u
red MS Office parsing
commit
|
commitdiff
|
tree
2008-02-17
Jukka La
u
ri Zi
t
ti
n
g
TIKA-123: Structured MS Offi
c
e
parsing
commit
|
commitdiff
|
tree
2008-01-26
J
u
kka L
a
uri Zitting
TIKA-118: Bouncy Cast
l
e b
i
naries
require US expo
r
t
s
.
.
.
commit
|
commitdiff
|
tree
2008-01-25
Jukka Lauri Z
i
tti
n
g
TIKA-96: Ti
k
a CLI
commit
|
commitdiff
|
tree
2008-01-22
Jukka Lau
r
i Z
i
tting
TIKA
-
97: Tika GUI
commit
|
commitdiff
|
tree
2008-01-22
Jukka Lauri Zitt
i
ng
TIKA-97:
Tika GUI
commit
|
commitdiff
|
tree
2008-01-22
J
u
kka
L
a
u
ri Zitting
T
IKA-97
:
Tika
GUI
commit
|
commitdiff
|
tree
2008-01-22
Jukka
L
auri Zitting
TIK
A
-
97:
T
ika G
U
I
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lauri Zitt
i
ng
T
IKA-115: Tika package with all t
h
e dependencies
commit
|
commitdiff
|
tree
2008-01-21
Ju
k
ka Lauri Zitti
n
g
TIKA-117: Dr
o
p JDOM and Jaxe
n
d
e
p
e
ndencies
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lauri
Z
itt
i
ng
TIK
A
-116
:
Strea
m
ing pa
r
ser for OpenDocument fil
e
s
commit
|
commitdiff
|
tree
2008-01-21
Ju
k
ka L
a
uri
Zitt
i
n
g
TIKA-109
:
WordP
a
rse
r
fa
i
ls o
n
some Word files
commit
|
commitdiff
|
tree
2008-01-20
Jukka Lauri Zitting
T
I
K
A-105: Excel parser imp
l
e
mentation
b
ased on
PO
I
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
J
u
kka La
u
ri
Z
i
t
ting
TIKA-105: Excel parser imple
m
entation based
o
n POI
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Juk
k
a Lauri Z
i
tting
TIK
A
-109: Wor
d
P
a
rser f
a
ils
on some Word files
commit
|
commitdiff
|
tree
2007-12-31
Jukka Lauri Zitt
i
ng
pom
.
xml: Updated tru
n
k v
e
rsi
o
n to 0
.
2-SNAPSHOT
commit
|
commitdiff
|
tree
2007-12-26
Ju
k
ka
L
auri Zitting
TIKA-111: Missing
licens
e
headers
commit
|
commitdiff
|
tree
2007-12-26
Jukk
a
Laur
i
Zi
t
t
ing
T
I
KA-110: Add
K
EYS file
f
or Tika
commit
|
commitdiff
|
tree
2007-12-21
Jukka
L
a
u
ri Zi
t
ting
TIKA-105 - Excel parser implementation
b
a
sed
o
n
P
OI
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
J
u
kka Lauri Zitting
TI
K
A-106 -
Remov
e
dependency
on Jakarta O
R
O
-
use JDK
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka
L
au
r
i Zitting
TIKA-
1
04
- Add ut
i
li
t
y methods to t
h
row IOException
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka L
a
u
ri Zi
t
ti
n
g
TIKA
-
107
-
Remov
e
us
e
of assertions
for argu
m
ent checking
commit
|
commitdiff
|
tree
2007-11-25
J
ukka Lauri Zitting
TIKA-102 -
P
arser implementations
loadin
g
a large amoun
t
.
.
.
commit
|
commitdiff
|
tree
2007-11-25
Ju
k
ka
Lauri Zit
t
ing
TIKA-
1
02 - Par
s
er implemen
t
ations
l
oading a la
r
g
e
amount
.
.
.
commit
|
commitdiff
|
tree
2007-11-20
Jukka Laur
i
Z
i
tting
TI
K
A-9
1
: A
d
d
p
rope
r
attribution for code from textmin
i
ng
.
or
g
commit
|
commitdiff
|
tree
2007-11-13
Jukka
L
a
u
ri
Z
itt
i
n
g
TIKA-100 - Structured
P
D
F parsing
commit
|
commitdiff
|
tree
2007-11-06
Jukka L
a
uri
Z
itt
i
n
g
TIKA
-
87 - Mim
e
Types sh
o
u
ld al
l
ow modificatio
n
of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-05
J
u
kka Lauri Zitting
TIKA-87 -
MimeTypes should al
l
o
w
mod
i
fi
c
a
t
ion
o
f
MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-04
Jukka Lauri Zittin
g
TI
K
A-87 - MimeTypes s
h
ould allow modifi
c
ation of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Jukka Lauri Zitting
TIKA-8
7
-
M
im
e
Types sh
o
uld allow
m
odi
f
ication of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Jukka L
a
uri
Z
i
t
ting
TIKA-87 - MimeTypes should allow m
o
dification of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-10-23
Jukk
a
Lauri
Z
itting
TIKA-
8
7
- Mi
m
eTypes should allo
w
m
odification of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
J
ukk
a
Lauri Zitting
T
I
KA-
8
5 - Add glob patter
n
s from t
h
e
A
SF
s
v
n
:eo
l
-style
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
J
ukka
L
a
uri Zitting
TIK
A
-84 -
A
dd M
i
meTypes
.
getMimeType(InputStream)
commit
|
commitdiff
|
tree
2007-10-19
Juk
k
a
L
a
uri Zitting
TIKA-84 - Add M
i
meTypes
.
getMi
m
eType(InputS
t
ream)
commit
|
commitdiff
|
tree
2007-10-19
Jukka Lauri Zitting
TIKA-83 - Create a org
.
apa
c
he
.
tika
.
sax pac
k
age for
.
.
.
commit
|
commitdiff
|
tree
2007-10-18
Jukka
L
auri Zitting
Set svn
:
eo
l
-s
t
yle to native
commit
|
commitdiff
|
tree
2007-10-18
Jukka La
u
ri Zitting
Correct indenting (four spac
e
s instead
of one as t
h
e
.
.
.
commit
|
commitdiff
|
tree
2007-10-16
J
ukk
a
Lauri Z
i
tt
i
ng
TIKA-71 -
R
emove Pars
e
rCo
n
f
i
g
a
nd ParserFac
t
ory
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri
Zitti
n
g
Removed
a
n
e
xt
r
a
d
e
b
u
g
p
r
int
commit
|
commitdiff
|
tree
2007-10-15
Jukka
Laur
i
Zi
t
ti
n
g
TIKA-70 - Better MIME inf
o
r
m
ation
for
t
he
Open Do
c
u
me
n
t
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
J
u
kka Lauri Zitting
T
I
KA-70
-
Better MIME information
f
or t
h
e
O
p
en Document
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
Jukka
Lau
r
i Zitting
TIKA-67 - Add an a
u
t
o-
d
ete
c
tin
g
Par
s
er implementa
t
ion
commit
|
commitdiff
|
tree
2007-10-15
J
u
kka Lauri Zitti
n
g
TIKA-
6
8 - Add dummy parser cl
a
sses to be used as sentinels
commit
|
commitdiff
|
tree
2007-10-14
Ju
k
ka La
u
ri Zitting
TIKA-66
- Use
Java 5 feature
s
in or
g
.
a
p
ache
.
tika
.
mime
commit
|
commitdiff
|
tree
2007-10-14
J
u
kka Laur
i
Zitting
T
IKA-6
3
-
A
void multiple passes ov
e
r
the inp
u
t stream
.
.
.
commit
|
commitdiff
|
tree
2007-10-14
J
u
kka
L
auri Zitting
T
IKA-60 - Renam
e
Microsoft parse
r
clas
s
es
commit
|
commitdiff
|
tree
2007-10-14
Juk
k
a
L
auri Z
i
tting
TIKA-6
0
-
Rename Micr
o
soft parse
r
c
lasses
commit
|
commitdiff
|
tree
2007-10-13
Ju
k
ka Lauri Zitting
TIKA-62 - Us
e
TikaConfig
.
getDe
f
aultConfig()
ins
t
ead
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
Jukka L
a
u
ri Zitting
TIKA-57 - Renam
e
org
.
apache
.
tika
.
ms to org
.
apache
.
t
i
ka
.
.
.
commit
|
commitdiff
|
tree
next