This chapter explains on ingest data tagging.
QueryIO supports on ingest data tagging. This means that when you upload any data to the cluster, the server automatically executes data analysis procedures [parsers] to extract information from the data that you are uploading.
You can also write and register your own data analysis procedures for different file types that you want to analyze. To see how you can write your own On Ingest Parser, refer to the developer documentation.
QueryIO ships with the parser that uses Apache Tika to extract metadata from various types of files. The file types supported are:
You can find compiled classes for this parser bundled in $INSTALL_HOME/demo/OnIngest.jar file.
QueryIO provides inbuilt On Ingest parsers, registered for following files types :
All tables related to file extensions will be automatically created. You can see in Manage Databases.
Upload any file of type png,log,csv,pdf,doc,docx,xls,xlsx or ppt.
QueryIO server will parse the uploaded file and update respective file type tables in the database.
You can use Query Designer to query the metadata extracted using these parsers.