Skip to content

jlalouf-eviden/lutece-tech-plugin-parser

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

lutece-tech-plugin-parser

This plugin exposes service for parsing different types of documents stream (pdf, office, openDocument, ooXML, pkg...). It uses tika and pdfbox projects.

Example:

IParserFactory factoryParser = SpringContextService.getBean( IParserFactory.BEAN_FILE_PARSER_FACTORY );
Optional parser = factoryParser.getParser( "application/pdf" );
File file = new File(sourcePdfPath);
InputStream inputStream = new FileInputStream(file);
String result = parser.get().parse( inputStream );\

About

This plugin exposes a service for parsing streams using tika and pdfbox project.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 100.0%