Tuesday, November 12, 2013

Work With Office Binary Files Formats DOC ODT XLS PPTX MSG on Hadoop

What's New in this Release?
Aspose team is proud to announce the release of Aspose for Hadoop. Apache Hadoop has great capabilities for archiving big data through its flexible distributed file system (HDFS) across several nodes. This big data solution is also powered by the MapReduce Framework which enables developers to analyze the archived data through its APIs. The big data may be structured or unstructured and may be in any file format. Keeping this in mind, we have the released first version of the Aspose for Hadoop project which enables developers to work with a number of file formats. Below is a list of the file formats supported in the initial version:
  • Microsoft Word (DOC)
  • WordprocessingML (DOCX, XML)
  • Rich Text Format (RTF)
  • HTML, XHTML and MHTML
  • OpenDocument (ODT)
  • Microsoft Excel (XLS)
  • SpreadsheetML (XLSX, XML)
  • OpenDocument Spreadsheet (ODS)
  • PresentationML (PPTX, XML)
  • Outlook Emails (MSG)
Using the Aspose for Hadoop project, the Hadoop developers can parse text from any of the above formats. The text can then be used in MapReduce analysis algorithms or for any other purpose depending on the use case.
Overview: Aspose
Aspose are file format experts. At Aspose you will find a wide variety of file management components. Supported formats include Word documents, Excel spreadsheets, PowerPoint presentations, PDF documents, Flash & Project files. Aspose produce components for .NET, Java and SharePoint as well as rendering extensions for SQL Server Reporting Services & JasperReports exporters. Aspose helps developers to become more productive & maximize their investments by delivering reliable solutions on time.
More about Aspose Products

No comments:

Post a Comment