Pienso Ingest lets you prepare your raw text for use as training data, whether it's structured or unstructured.
Ingest supports the direct upload of your text documents from a wide variety of sources and formats. Whether you're looking to bulk-upload from your data lake via API or just upload PDFs in a zip file, Pienso has you covered.
Ingest's Refine tool lets you adjust your data before upload: remove or replace extraneous words, phrases, or documents; extract a randomly sampled subset of your documents; or split long documents into smaller ones.
Once it's refined, format your data set by identifying which column contains the text you want to analyze, and which columns are metadata.
From here, you can: use this data as training data for a fingerprint model, or as training data for a deep learning model (if it has tags already); or send it to an active Pienso deployment for scoring and then to be visualized in a Pienso Dashboard.
Pienso Explore lets you search a data set for documents that match your interest.