repo.or.cz
/
tika.git
/
search
commit
grep
author
committer
pickaxe
?
search:
re
summary
|
log
|
graphiclog1
|
graphiclog2
|
commit
|
commitdiff
|
tree
|
refs
|
edit
|
fork
first
·
prev
·
next
TIKA-122: Use Commons IO 1.4
2008-02-19
J
u
kka
L
auri Zitting
TI
K
A-122: Use C
o
m
m
ons
IO 1
.
4
commit
|
commitdiff
|
tree
2008-02-18
Jukka Lauri Zitt
i
ng
TIKA-1
2
3
: Struc
t
ured
MS Offic
e
par
s
ing
commit
|
commitdiff
|
tree
2008-02-18
J
ukka Lau
r
i Zitting
TIKA-123: Structured MS Office pa
r
si
n
g
commit
|
commitdiff
|
tree
2008-02-18
Jukka Laur
i
Z
i
tting
TIKA-123:
Stru
c
tured MS Office
p
a
rsing
commit
|
commitdiff
|
tree
2008-02-18
Jukka Laur
i
Zit
t
i
ng
TIKA-103: Excel parsing
i
gnores c
e
ll formating
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lauri
Zit
t
ing
TIKA-123: Structured MS
O
f
fice p
a
rsing
commit
|
commitdiff
|
tree
2008-02-17
Jukka
L
auri Zit
t
ing
TIKA-123: Structure
d
MS Office parsin
g
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lauri Zittin
g
T
I
KA-12
3
: Structured MS O
f
fice parsin
g
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lauri Zitting
TIKA-123
:
S
t
ruc
t
ure
d
M
S Office p
a
rsing
commit
|
commitdiff
|
tree
2008-01-26
Jukka Lauri Zitti
n
g
TIKA-118: Bouncy Castle binaries re
q
u
ire
U
S e
x
po
r
t
s
.
.
.
commit
|
commitdiff
|
tree
2008-01-25
J
ukka La
u
ri Z
i
tting
T
I
KA
-
96: Tika CLI
commit
|
commitdiff
|
tree
2008-01-22
Jukka Lauri Zittin
g
TIKA-97
:
Tika GUI
commit
|
commitdiff
|
tree
2008-01-22
Ju
k
ka Laur
i
Z
i
tti
n
g
TIKA-97:
T
ika GUI
commit
|
commitdiff
|
tree
2008-01-22
Juk
k
a Lau
r
i Zitt
i
n
g
T
I
KA-97: Tika
G
UI
commit
|
commitdiff
|
tree
2008-01-22
J
u
k
k
a Lauri Zitting
T
I
KA-97:
T
ika GUI
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lauri Zitting
TIK
A
-115: T
i
ka packag
e
with all the
depende
n
cies
commit
|
commitdiff
|
tree
2008-01-21
J
ukka La
u
ri
Z
i
t
ting
TIKA-
1
17:
D
rop JD
O
M and Jaxen dependencies
commit
|
commitdiff
|
tree
2008-01-21
Jukka L
a
u
ri Z
i
tting
TIK
A
-116: Str
e
aming parser for OpenDocument files
commit
|
commitdiff
|
tree
2008-01-21
Jukk
a
La
u
ri Zitting
TIKA-109: WordPar
s
er fai
l
s
on some Word fi
l
es
commit
|
commitdiff
|
tree
2008-01-20
Jukka Lauri Zitting
T
I
KA-105: Excel parser implem
e
ntation based on POI
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Juk
k
a Lauri Z
i
t
ting
TIKA-105: Exc
e
l pa
r
ser
implementation b
a
sed on POI
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Jukk
a
Lauri Zi
t
t
ing
TIKA-109: WordParser fai
l
s on some Word files
commit
|
commitdiff
|
tree
2007-12-31
Jukka
Lau
r
i Zit
t
ing
pom
.
xml: Updated trunk ve
r
s
ion to 0
.
2-SN
A
PSH
O
T
commit
|
commitdiff
|
tree
2007-12-26
Ju
k
ka
L
auri Zit
t
ing
TI
K
A-111: Missing
lice
n
se header
s
commit
|
commitdiff
|
tree
2007-12-26
Jukka Lauri Zitting
TIKA
-
110: Add KEYS f
i
l
e for
Tika
commit
|
commitdiff
|
tree
2007-12-21
Jukka
Lauri Zi
t
ting
T
IKA-
1
0
5
- Excel parser
i
m
plementation
based on POI
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
J
ukka
L
a
ur
i
Zitting
TIKA-10
6
- Remove depe
n
de
n
c
y
o
n Jakart
a
ORO -
use JDK
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka La
u
r
i
Z
it
t
i
ng
TIKA-104 - Add utilit
y
meth
o
ds to
thro
w
IOEx
c
eptio
n
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka
L
auri Zitting
T
I
KA-107
-
Remov
e
us
e
of assertions f
o
r a
r
gum
e
nt checking
commit
|
commitdiff
|
tree
2007-11-25
Jukka Lauri Zitt
i
ng
TIKA-102 - Parse
r
imple
m
entations
l
oading a l
a
r
g
e amount
.
.
.
commit
|
commitdiff
|
tree
2007-11-25
Jukka La
u
ri Zit
t
ing
TIKA-1
0
2
- Parser implementations loading a larg
e
amount
.
.
.
commit
|
commitdiff
|
tree
2007-11-20
Juk
k
a Laur
i
Zitting
TIKA-
9
1:
A
dd
p
r
o
per
attribu
t
ion for code from textmining
.
org
commit
|
commitdiff
|
tree
2007-11-13
Jukka
L
auri Zitting
TIKA-100 - St
r
uctured
PD
F
parsing
commit
|
commitdiff
|
tree
2007-11-06
Jukk
a
Lauri Zit
t
ing
TIKA-87 - MimeTypes s
h
oul
d
allow mod
i
f
i
cation
o
f
MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-05
Juk
k
a Laur
i
Zitting
TIKA-87 - MimeTypes shou
l
d al
l
o
w modification of MI
M
E
.
.
.
commit
|
commitdiff
|
tree
2007-11-04
Jukka Lauri Zitting
TIKA-87
- MimeTypes should allow modification of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Jukk
a
L
aur
i
Zitti
n
g
TIKA-87 - MimeTypes
sh
o
uld all
o
w modi
f
ication of
MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Jukka L
a
uri Zitt
i
ng
TIKA-87 - MimeTypes should allow
modification of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-10-23
J
u
kka Lauri Z
i
tting
TIKA-87 - MimeTypes should
a
llow mod
i
fic
a
t
i
on of M
I
ME
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Jukka Lauri
Zitting
TIKA-85 - Add glob
p
a
tterns
f
r
o
m
t
he ASF svn:eol-style
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Jukka
L
a
ur
i
Zi
t
t
ing
T
IKA-84 - Add MimeTy
p
e
s
.
g
etM
i
meType(InputSt
r
eam)
commit
|
commitdiff
|
tree
2007-10-19
J
u
kka Lauri
Z
i
tt
i
n
g
TIKA-8
4
- Add MimeTy
p
e
s
.
getMime
T
y
p
e(I
n
p
u
tStre
a
m
)
commit
|
commitdiff
|
tree
2007-10-19
Jukka Lauri Zitting
TI
K
A-83 - Cr
e
a
te a
o
r
g
.
apach
e
.
tik
a
.
sax package for
.
.
.
commit
|
commitdiff
|
tree
2007-10-18
Jukka Lauri Zitting
Set svn:eol-style to native
commit
|
commitdiff
|
tree
2007-10-18
Jukk
a
Lau
r
i Zitt
i
ng
C
o
rrect indenting (f
o
ur spac
e
s instea
d
of on
e
as the
.
.
.
commit
|
commitdiff
|
tree
2007-10-16
Jukka Lauri
Z
itti
n
g
TIKA-71 - Remove Par
s
erConfi
g
and
ParserFactory
commit
|
commitdiff
|
tree
2007-10-15
Jukk
a
Lauri Zitting
Removed
a
n
extra debug print
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Zitting
T
I
KA-
7
0 - Better MIME information for the
Open Document
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
J
u
k
ka Lau
r
i Zitting
TIKA-70 - Bette
r
M
IM
E
info
r
m
ation f
o
r the
Open
D
o
cument
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
Jukka
Lauri
Z
i
t
ting
T
IKA-67 - Add
an
a
uto
-
d
etecting P
a
rser implem
e
n
t
ation
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Zi
t
ting
TIKA-68 -
Add dumm
y
parser c
l
asses
t
o be use
d
as sentinels
commit
|
commitdiff
|
tree
2007-10-14
J
ukk
a
L
auri Zitting
TIKA-
6
6
- Us
e
J
ava 5 features in org
.
apac
h
e
.
t
i
ka
.
mime
commit
|
commitdiff
|
tree
2007-10-14
Jukka Lauri Zitting
TIKA-63 -
Avo
i
d multiple
passe
s
over the input st
r
eam
.
.
.
commit
|
commitdiff
|
tree
2007-10-14
Jukka Lau
r
i
Zitt
i
n
g
TIKA-60 - Rename
Microsoft parser classes
commit
|
commitdiff
|
tree
2007-10-14
Jukka Laur
i
Z
i
t
ting
T
I
KA-6
0
- Rename Mi
c
rosoft
parse
r
classes
commit
|
commitdiff
|
tree
2007-10-13
Jukka Laur
i
Zitting
TIKA-62 - Use Tika
C
o
nfig
.
getD
e
fault
C
onfig() instead
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
Jukka La
u
ri Zitting
TIKA-57 - Rename org
.
apa
c
h
e
.
tik
a
.
ms to o
r
g
.
apache
.
tika
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
Jukka
Lauri Zitting
TIKA-53 - XHTM
L
S
AX events from
parsers
commit
|
commitdiff
|
tree
2007-10-10
Jukka
Lauri
Z
itting
TIKA-40
-
T
i
ka nee
d
s to support d
i
verse character e
n
co
d
ings
commit
|
commitdiff
|
tree
2007-10-08
Jukka Lau
r
i Zitting
TIKA
-
41 -
R
esource files occur twice in jar file
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Zittin
g
TIKA-45 - Re
r
eadab
l
e
I
nput
S
tre
a
m
needs to be abl
e
to
.
.
.
commit
|
commitdiff
|
tree
2007-10-07
J
ukka Lauri Zitting
TIKA-
4
8
- M
e
rge MS E
x
t
r
a
ct
o
rs
a
n
d
Parsers
commit
|
commitdiff
|
tree
2007-10-07
J
u
kka Lauri Zitt
i
ng
TIKA-46 - Use Metadat
a
in Pa
r
ser
commit
|
commitdiff
|
tree
2007-10-07
Ju
k
ka Lauri Zitt
i
ng
TIKA-46 - Use Met
a
data in
P
ar
s
er
commit
|
commitdiff
|
tree
2007-10-07
Jukka La
u
ri Zitting
S
e
t svn:eol-style to nat
i
ve
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Zitti
n
g
TIK
A
-46
- Use Metadata in P
a
rser
commit
|
commitdiff
|
tree
2007-10-07
Jukk
a
La
u
ri Zitting
TIKA-47 -
R
emove TikaLog
g
er
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri
Zitting
T
IK
A
-43 - Pa
r
se
r
i
nterface
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Zittin
g
TI
K
A-43
-
Parser
i
n
t
e
r
face
commit
|
commitdiff
|
tree
2007-10-05
Jukka La
u
ri Zitting
T
IKA-42 - Content class n
e
eds (
S
tring, Stri
n
g,
S
trin
g
.
.
.
commit
|
commitdiff
|
tree
2007-10-05
J
u
kka Lauri Zit
t
ing
TI
K
A-44 - Spaces for indentatio
n
commit
|
commitdiff
|
tree
2007-10-01
Jukka Laur
i
Zitting
TIKA-33 -
Stateless p
a
rser
s
commit
|
commitdiff
|
tree
2007-09-25
Jukka Lauri Zitting
TIK
A
-
3
1 - protected Parser
.
parse(InputStream st
r
eam
.
.
.
commit
|
commitdiff
|
tree
2007-09-25
Jukka Lauri Zittin
g
typo
commit
|
commitdiff
|
tree
2007-09-25
Juk
k
a Lauri Zitting
TIKA-26 -
U
se Map<S
t
r
ing, Content>
i
n
ste
a
d
of List
.
.
.
commit
|
commitdiff
|
tree
2007-09-25
Jukka Lauri Zitting
TIKA-
2
6 -
I
mplemented Parser
.
ge
t
StrConte
n
t() in the
.
.
.
commit
|
commitdiff
|
tree
2007-09-24
Jukka
La
u
ri Zitting
TI
K
A-
2
6 - Implemented Parser
.
getContent(
S
t
r
i
n
g) in
.
.
.
commit
|
commitdiff
|
tree
2007-09-24
Juk
k
a Lauri Zitting
TIKA-
3
0
- Added
utility c
o
nstruc
t
ors to TikaConfig
commit
|
commitdiff
|
tree
2007-09-24
Jukka Lauri Zitting
TIKA-27
-
Repla
c
ed more "
l
i
us" references w
i
th "tika
"
commit
|
commitdiff
|
tree
2007-09-24
Jukka
L
auri
Zitting
TI
K
A
-17 - Rename all
"
Luis"
class
e
s
to be "T
i
ka" cla
s
s
es
commit
|
commitdiff
|
tree
2007-09-24
Jukka Lauri Zitting
TIK
A
-2
1
- Simplified c
o
n
f
i
g
ur
a
tion code
commit
|
commitdiff
|
tree
2007-09-23
J
ukka Lau
r
i Zitting
TIKA-25
- Removed hardcoded
r
eference to C:\oo
.
xml
.
.
.
commit
|
commitdiff
|
tree
2007-09-21
J
u
kka L
a
uri Zitting
TIKA-12 - De
c
ouple Pars
e
r from
P
arserConfig
commit
|
commitdiff
|
tree
2007-09-17
Jukka Lauri Zitting
T
I
KA-15
:
Applied patch from Keith
B
ennett
.
commit
|
commitdiff
|
tree
2007-09-13
J
u
kk
a
Lauri Zitting
TIKA
-
1
2
: Adde
d
MimeT
y
pesUtils te
s
t case
c
ont
r
ibut
e
d
.
.
.
commit
|
commitdiff
|
tree
2007-09-13
Jukka Lauri Zittin
g
TI
K
A-12:
Support MIME
typ
e
detection based on a URL
.
.
.
commit
|
commitdiff
|
tree
2007-08-17
Jukka Lauri Zitting
TIKA-8
:
Replaced the
jmimeinfo
depen
d
e
n
c
y
with a tri
v
i
a
l
.
.
.
commit
|
commitdiff
|
tree
2007-08-17
J
uk
k
a
Laur
i
Z
i
tting
TIKA-7:
Added miss
i
ng dep
e
ndencies to POM
.
commit
|
commitdiff
|
tree
2007-08-17
Jukka Lauri
Z
ittin
g
pom
.
xml
:
R
e
place
d
t
a
b
s wi
t
h
spaces
,
fixed indentat
i
on
.
commit
|
commitdiff
|
tree
2007-08-17
Jukka Lauri Zitting
TIKA-
7
: Added the
L
i
us Lit
e
code from Rida
.
E
xter
n
al
.
.
.
commit
|
commitdiff
|
tree
2007-03-31
Jukka Lauri
Zit
t
ing
TIKA-4:
A
dd
e
d brief Maven
b
uil
d
instructions and some
.
.
.
commit
|
commitdiff
|
tree
2007-03-31
Jukka Lauri Zitting
TI
K
A-2: Th
e
site
i
s deployed to th
e
i
n
cubator
/
t
ik
a
.
.
.
commit
|
commitdiff
|
tree
2007-03-31
Jukka Lauri
Z
itting
TIKA-2: Basi
c
web
s
i
te based on Ma
v
en
2
.
commit
|
commitdiff
|
tree
2007-03-31
Jukka
L
a
u
ri Zitting
TIKA-4: Ignore Eclipse project f
i
les
.
commit
|
commitdiff
|
tree
2007-03-31
Jukka
L
auri Zittin
g
TI
K
A-4: Basic Maven 2 POM and source tr
e
e
f
or Tika
.
commit
|
commitdiff
|
tree
2007-03-31
J
ukka Lauri Zitting
TIKA-1: Standard README
,
NOTICE, and LICENSE files
.
commit
|
commitdiff
|
tree
2007-03-31
Jukk
a
L
a
u
r
i
Zitt
i
ng
TIKA-1: Stan
d
a
rd {trunk,branch
e
s,tags}
s
etup
commit
|
commitdiff
|
tree