On Ingest Data Tagging

In this chapter

This chapter explains on ingest data tagging.

What is On Ingest Tagging

QueryIO supports on ingest data tagging. This means that when you upload any data to the cluster, the server automatically executes data analysis procedures [parsers] to extract information from the data that you are uploading.

You can also write and register your own data analysis procedures for different file types that you want to analyze. To see how you can write your own On Ingest Parser, refer to the developer documentation.

QueryIO ships with the parser that uses Apache Tika to extract metadata from various types of files. The file types supported are:

You can find compiled classes for this parser bundled in $INSTALL_HOME/demo/OnIngest.jar file.

Registered On Ingest Parsers

QueryIO provides inbuilt On Ingest parsers, registered for following files types :




All tables related to file extensions will be automatically created. You can see in Manage Databases.

Upload any file of type png,log,csv,pdf,doc,docx,xls,xlsx or ppt.
QueryIO server will parse the uploaded file and update respective file type tables in the database.





You can use Query Designer to query the metadata extracted using these parsers.



Copyright 2017 QueryIO Corporation. All Rights Reserved.

QueryIO, "Big Data Intelligence" and the QueryIO Logo are trademarks of QueryIO Corporation. Apache, Hadoop and HDFS are trademarks of The Apache Software Foundation.