repo.or.cz
/
tika.git
/
search
commit
grep
author
committer
pickaxe
?
search:
re
summary
|
log
|
graphiclog1
|
graphiclog2
|
commit
|
commitdiff
|
tree
|
refs
|
edit
|
fork
first
·
prev
·
next
TIKA-126: Add Parser.parse(InputStream, Metadata) for metadata extraction
2008-03-09
J
u
kka Lauri
Z
i
t
t
ing
TIKA-126: Add Parser
.
parse(InputStream, Metadata)
f
o
r
.
.
.
commit
|
commitdiff
|
tree
2008-03-09
J
ukka Lau
r
i Zitting
TIKA-
1
23: Structu
r
ed MS Offi
c
e parsing
commit
|
commitdiff
|
tree
2008-03-09
Ju
k
ka Laur
i
Zitting
TI
K
A
-
123: Structured MS Office parsing
commit
|
commitdiff
|
tree
2008-02-19
Jukka L
a
uri Zitting
T
IKA-123:
Str
u
ctured MS Office parsi
n
g
commit
|
commitdiff
|
tree
2008-02-19
Jukka
L
a
uri Zitting
TIKA-122: Use Commons IO 1
.
4
commit
|
commitdiff
|
tree
2008-02-18
Jukka Lauri Zitting
T
IK
A
-1
2
3: Str
u
cture
d
MS Off
i
c
e parsing
commit
|
commitdiff
|
tree
2008-02-18
Jukka Lauri
Z
itt
i
ng
TIKA-12
3
:
S
t
ructured MS
Off
i
ce parsing
commit
|
commitdiff
|
tree
2008-02-18
Jukka Lauri
Zi
t
ti
n
g
TIKA-123: Structured
M
S
Office p
a
rsing
commit
|
commitdiff
|
tree
2008-02-18
Jukka Lauri Zitt
i
n
g
T
I
KA-103
:
E
x
cel parsi
n
g
ign
o
res cell formati
n
g
commit
|
commitdiff
|
tree
2008-02-17
Jukka
L
auri Zitting
TIKA-123: St
r
uctured MS
O
ffice parsing
commit
|
commitdiff
|
tree
2008-02-17
Juk
k
a Lauri
Z
itti
n
g
TIKA-123:
Struc
t
ured
MS Of
f
i
c
e
p
ar
s
ing
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lauri
Z
ittin
g
TIKA-123: Structured M
S
Office parsing
commit
|
commitdiff
|
tree
2008-02-17
Jukka Lauri
Zitting
TIKA-1
2
3: St
r
uctured MS
Office parsing
commit
|
commitdiff
|
tree
2008-01-26
Jukka Lauri Zitt
i
ng
T
IKA-118: Bouncy Ca
s
tle bina
r
ies require US exports
.
.
.
commit
|
commitdiff
|
tree
2008-01-25
J
u
kk
a
L
au
r
i
Z
itting
T
I
KA
-
96:
T
i
k
a CLI
commit
|
commitdiff
|
tree
2008-01-22
Jukka
L
auri Zitting
TIKA-97:
T
i
k
a GUI
commit
|
commitdiff
|
tree
2008-01-22
J
u
kka
Lauri Zitting
TIKA-97: T
i
k
a
G
U
I
commit
|
commitdiff
|
tree
2008-01-22
Jukka Lauri Zitti
n
g
TIKA-97: Tika GUI
commit
|
commitdiff
|
tree
2008-01-22
Juk
k
a Lauri Zittin
g
TIKA-97:
T
ika GUI
commit
|
commitdiff
|
tree
2008-01-21
Jukk
a
La
u
r
i Zitti
n
g
TI
K
A-115: Tika pack
a
g
e w
i
th all the depen
d
encies
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lauri Zitti
n
g
TIK
A
-117: Drop JDO
M
and
Ja
x
en d
e
pendencies
commit
|
commitdiff
|
tree
2008-01-21
Jukka Lauri Zit
t
i
n
g
TIKA-116: Str
e
aming parse
r
f
or OpenDocument files
commit
|
commitdiff
|
tree
2008-01-21
Ju
k
ka Lauri Zitting
TIKA
-
109: WordParse
r
fa
i
l
s
o
n
s
ome Word file
s
commit
|
commitdiff
|
tree
2008-01-20
Jukka
Lauri Zitt
i
n
g
T
I
KA-105: E
x
c
el pars
e
r
i
m
p
leme
n
tatio
n
b
ased on
POI
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Jukk
a
Lauri Zitting
T
I
KA-105: Excel parse
r
imple
m
entation based o
n
P
OI
.
.
.
commit
|
commitdiff
|
tree
2008-01-20
Jukk
a
Lauri Zitting
TIKA-109: WordParse
r
fails on some Word files
commit
|
commitdiff
|
tree
2007-12-31
Jukka L
a
uri
Z
itti
n
g
pom
.
xm
l
: Updat
e
d trunk version to 0
.
2-SNAPSHO
T
commit
|
commitdiff
|
tree
2007-12-26
Jukka Lauri Zitti
n
g
TIKA-111: Missing l
i
c
ense headers
commit
|
commitdiff
|
tree
2007-12-26
J
u
kka Lau
r
i Zitting
TIK
A
-110:
Add KEY
S
file for
Tika
commit
|
commitdiff
|
tree
2007-12-21
J
u
kka Laur
i
Zitting
T
I
K
A-105 - Excel pa
r
s
er implem
e
nt
a
tion based
o
n P
O
I
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukk
a
Lauri
Z
itting
TIK
A
-
1
0
6
- R
e
mo
v
e dependency on J
a
karta
ORO - use JDK
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
Jukka Lauri Zitti
n
g
TIKA-
1
04
-
Add utility met
h
ods to throw IOException
.
.
.
commit
|
commitdiff
|
tree
2007-12-21
J
u
k
ka Lauri Zitting
TIKA-1
0
7 - Remo
v
e use of asserti
o
ns for
a
r
gument ch
e
cking
commit
|
commitdiff
|
tree
2007-11-25
Jukka Lauri Zitting
TIKA-102 - Parser imp
l
ementations
l
o
ading a lar
g
e amount
.
.
.
commit
|
commitdiff
|
tree
2007-11-25
Jukka Laur
i
Zittin
g
TI
K
A-102 - Parse
r
implementations l
o
ading a large
a
mount
.
.
.
commit
|
commitdiff
|
tree
2007-11-20
Jukka Lauri Zitting
TIKA-91: A
d
d proper a
t
t
r
i
buti
o
n for code
from
t
extm
i
ning
.
org
commit
|
commitdiff
|
tree
2007-11-13
Jukka La
u
ri Zitt
i
ng
TIKA
-
100 - Stru
c
tu
r
ed PDF
p
arsing
commit
|
commitdiff
|
tree
2007-11-06
Jukka Lau
r
i Zitt
i
ng
TI
K
A-87
-
Mi
m
eTyp
e
s should all
o
w modific
a
tion of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-05
Jukka La
u
ri Zitting
TIKA-87 - MimeTypes should
a
llow modificatio
n
of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-04
Jukka Lauri Zi
t
ting
TIK
A
-87 - Mi
m
eTypes should
a
llow m
o
dification of
MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Jukka
L
aur
i
Zitting
TIKA-87
-
MimeTypes s
h
ould
a
llo
w
modification
o
f MIME
.
.
.
commit
|
commitdiff
|
tree
2007-11-03
Ju
k
ka
Lauri Zi
t
ting
TIKA-87 - MimeTypes s
h
ould allow
m
odifica
t
ion of MIME
.
.
.
commit
|
commitdiff
|
tree
2007-10-23
Juk
k
a
L
auri Zit
t
i
n
g
TIKA-87 - Mim
e
Types
s
hould allo
w
modification of
MIME
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Jukka Lauri Zitting
TIKA-
8
5 - Add glob
patterns f
r
om th
e
A
S
F svn:eol-st
y
le
.
.
.
commit
|
commitdiff
|
tree
2007-10-22
Jukka Lauri Zi
t
ting
TIK
A
-84 - Add MimeTypes
.
getMimeType(InputStream)
commit
|
commitdiff
|
tree
2007-10-19
Jukka Lauri Zi
t
ting
TIKA-84 - Add MimeType
s
.
ge
t
Mi
m
eTyp
e
(I
n
putStream)
commit
|
commitdiff
|
tree
2007-10-19
Jukka Lauri Zitting
TIKA-
8
3
- Create a org
.
apache
.
tik
a
.
sax
p
ackage fo
r
.
.
.
commit
|
commitdiff
|
tree
2007-10-18
Jukka Lauri
Z
itt
i
n
g
Set
svn:eol
-
st
y
le
t
o
n
ativ
e
commit
|
commitdiff
|
tree
2007-10-18
J
ukk
a
L
a
uri
Zitting
Correct
i
nde
n
ting (
f
our spaces instead of one as
the
.
.
.
commit
|
commitdiff
|
tree
2007-10-16
Jukka Laur
i
Z
i
tting
T
I
KA-71
- Remove
Pa
r
serC
o
nfig and ParserFactory
commit
|
commitdiff
|
tree
2007-10-15
Ju
k
ka Lauri Zit
t
ing
Removed an extra debug print
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Zitting
TIKA-70
- Better MIME infor
m
ation
for
t
h
e Open Document
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
J
u
kka Lauri
Zitting
TIKA-70 - Better MI
M
E information for the Open Document
.
.
.
commit
|
commitdiff
|
tree
2007-10-15
J
u
kk
a
Laur
i
Zittin
g
TI
K
A
-6
7
- Add an
a
uto-de
t
ecting Parser
imp
l
ementation
commit
|
commitdiff
|
tree
2007-10-15
Jukka Lauri Zitting
T
I
KA-68 - Add dummy pars
e
r c
l
asses to be used as s
e
ntinel
s
commit
|
commitdiff
|
tree
2007-10-14
J
u
kka Lauri Zitting
TIKA-66
- Use Java 5 f
e
atures in org
.
apache
.
tika
.
mi
m
e
commit
|
commitdiff
|
tree
2007-10-14
J
uk
k
a L
a
u
ri Zitting
TIKA-6
3
- Avoid
m
ulti
p
le
p
a
sses over
t
he in
p
ut s
t
ream
.
.
.
commit
|
commitdiff
|
tree
2007-10-14
Jukka Lauri Zittin
g
T
IKA-60 - Rena
m
e
M
icrosoft
parser cl
a
sses
commit
|
commitdiff
|
tree
2007-10-14
Jukka Lauri Zitting
T
I
KA-60 - Rename Microsoft parser classes
commit
|
commitdiff
|
tree
2007-10-13
Jukka Lauri
Zitt
i
n
g
T
IKA-62 - Use
T
ikaCon
f
ig
.
getDefaul
t
Config
(
) ins
t
ead
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
Jukka
L
aur
i
Zitting
TIKA-
5
7 - Re
n
ame org
.
apache
.
t
i
ka
.
ms to org
.
apac
h
e
.
tika
.
.
.
commit
|
commitdiff
|
tree
2007-10-12
Jukka Lauri Zit
t
ing
TIKA-53 - XHTML SAX events from p
a
rse
r
s
commit
|
commitdiff
|
tree
2007-10-10
Jukka Lauri Zitting
T
I
KA-40
- Tika ne
e
ds t
o
support
d
iver
s
e character
encodings
commit
|
commitdiff
|
tree
2007-10-08
Ju
k
ka Lauri Zit
t
ing
TI
K
A-41 - Resource files occur
t
w
ic
e
i
n
jar f
i
le
commit
|
commitdiff
|
tree
2007-10-07
Jukka L
a
u
ri Zittin
g
TIKA-45 - Re
r
eadableI
n
p
u
tStream needs to be able to
.
.
.
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lau
r
i Zittin
g
TIKA-48 -
Merge MS
E
xtracto
r
s
a
nd Pa
r
sers
commit
|
commitdiff
|
tree
2007-10-07
Ju
k
ka Lauri
Z
itting
TIKA-46 - Use Metadata in Parser
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Z
i
t
t
ing
TIKA-46 - Use
M
etadata i
n
Parser
commit
|
commitdiff
|
tree
2007-10-07
J
u
kka Lauri Zitting
S
e
t
sv
n
:eol-sty
l
e to nati
v
e
commit
|
commitdiff
|
tree
2007-10-07
Jukka Lauri Z
i
tt
i
ng
TI
K
A-46
-
Us
e
M
e
tadata
i
n Par
s
er
commit
|
commitdiff
|
tree
2007-10-07
Jukk
a
Lauri Zitting
TIKA-47 - Remove TikaLogger
commit
|
commitdiff
|
tree
2007-10-07
J
u
kka
Lauri Zitt
i
ng
TIKA-43 -
P
arser
interface
commit
|
commitdiff
|
tree
2007-10-07
J
u
kka Lauri Zitting
TIK
A
-43 - Parser inter
f
ace
commit
|
commitdiff
|
tree
2007-10-05
Jukka L
a
uri Zitting
TIKA-4
2
- Content cl
a
ss nee
d
s (Str
i
ng,
S
tring, Strin
g
.
.
.
commit
|
commitdiff
|
tree
2007-10-05
J
ukk
a
Lauri
Z
itting
T
I
KA-44 - Spaces for indentation
commit
|
commitdiff
|
tree
2007-10-01
J
ukka Lauri Zitting
TIKA-
3
3
- Stat
e
less
parsers
commit
|
commitdiff
|
tree
2007-09-25
Jukka
Lauri Zitting
TIK
A
-
3
1
- protected Parser
.
parse(InputSt
r
eam
st
r
eam
.
.
.
commit
|
commitdiff
|
tree
2007-09-25
Jukka Lauri
Zitting
typo
commit
|
commitdiff
|
tree
2007-09-25
J
u
kka Lauri Z
i
tting
T
IK
A
-26
-
Use Map<String,
Content> instead of List
.
.
.
commit
|
commitdiff
|
tree
2007-09-25
Jukka Lauri Zitting
TIKA-26
- Implemented Parser
.
g
etSt
r
Conten
t
() in the
.
.
.
commit
|
commitdiff
|
tree
2007-09-24
Juk
k
a Laur
i
Zitting
TIK
A
-26 -
Im
p
lemented Parser
.
getCon
t
ent(S
t
r
ing) in
.
.
.
commit
|
commitdiff
|
tree
2007-09-24
J
u
kk
a
La
u
ri
Z
itting
TIKA
-
30 - Ad
d
ed utility const
r
u
ctors to TikaConfig
commit
|
commitdiff
|
tree
2007-09-24
J
u
kka Lauri Zitting
TIKA-27 - Replaced more "li
u
s" references
with
"tika"
commit
|
commitdiff
|
tree
2007-09-24
Jukka Lauri Zitt
i
ng
TI
K
A-17 - Re
n
a
m
e all "
L
uis" classes to be "Tika" classes
commit
|
commitdiff
|
tree
2007-09-24
Jukka L
a
u
r
i Zitti
n
g
T
IKA-21 - Simplified
c
onfiguration
c
ode
commit
|
commitdiff
|
tree
2007-09-23
Jukka Laur
i
Zitting
TI
K
A
-
25 - Re
m
ove
d
h
a
rdcoded
r
e
ference t
o
C:
\
oo
.
xml
.
.
.
commit
|
commitdiff
|
tree
2007-09-21
Jukka Lauri Zitting
T
I
K
A
-12 -
Decouple P
a
rser f
r
om
P
arserConfig
commit
|
commitdiff
|
tree
2007-09-17
Jukka
L
auri
Z
i
tting
TIKA-15: A
p
plied patc
h
f
rom Keith Be
n
nett
.
commit
|
commitdiff
|
tree
2007-09-13
J
ukka Lauri Zitting
TIKA-12: Added MimeTypesUtils test case co
n
tributed
.
.
.
commit
|
commitdiff
|
tree
2007-09-13
Jukka Lau
r
i Zitt
i
n
g
TIKA-1
2
: Support MIME type
d
e
t
e
ction b
a
se
d
on a URL
.
.
.
commit
|
commitdiff
|
tree
2007-08-17
Jukka Lauri
Zi
t
ting
TIKA-8: Replac
e
d the jmimeinfo dependency wi
t
h a
t
rivial
.
.
.
commit
|
commitdiff
|
tree
2007-08-17
J
ukka
L
a
u
r
i Z
i
tting
TIKA-7: Added
missi
n
g d
e
pe
n
de
n
c
ies
t
o POM
.
commit
|
commitdiff
|
tree
2007-08-17
Juk
k
a Lauri Zitting
pom
.
xml
:
Replac
e
d ta
b
s with spaces,
f
ixe
d
inden
t
a
tion
.
commit
|
commitdiff
|
tree
2007-08-17
Juk
k
a L
a
uri Zitting
T
IKA-7: Added the Li
u
s Lite co
d
e from Rida
.
External
.
.
.
commit
|
commitdiff
|
tree
2007-03-31
Jukka
Lauri Z
i
ttin
g
TIKA-4: Adde
d
brief Maven build inst
r
u
c
tion
s
and some
.
.
.
commit
|
commitdiff
|
tree
2007-03-31
Ju
k
ka Lauri Zitting
TIKA-2: The sit
e
i
s
depl
o
y
ed to t
h
e incubat
o
r/tik
a
.
.
.
commit
|
commitdiff
|
tree
2007-03-31
Jukka Lauri Zitting
TIKA-2: Basic web site based on
Maven
2
.
commit
|
commitdiff
|
tree
2007-03-31
Jukka
L
a
uri Zitting
T
I
KA-4: Ignore Eclipse project files
.
commit
|
commitdiff
|
tree
2007-03-31
J
ukka La
u
ri Zitting
TIKA-4: Basic Maven 2 POM and
s
our
c
e
tree f
o
r Tika
.
commit
|
commitdiff
|
tree
2007-03-31
Juk
k
a L
a
ur
i
Zi
t
ti
n
g
TIKA-
1
: St
a
ndard RE
A
DME,
N
OTICE, and
L
ICENSE files
.
commit
|
commitdiff
|
tree
next