TIKA-123: Structured MS Office parsing