Importing data from the file

Import data from single or multiple files in any of a number of formats, including JSON, TXT, CSV, TSV, Excel, and PDF.

Tomas Larsson avatar
Written by Tomas Larsson
Updated over a week ago

Keywords: data import, text file, json, csv, tsv, txt, pdf, xls, xlsx, excel, multiple file import

Dcipher's data import supports the most common formats for text data, including TXT, PDF, Excel, JSON (and various near-JSON formats), CSV, and TSV. File encoding, delimiters, column separators, missing values, and Excel sheets are identified and handled properly during import. Multi-file import is available for TXT and PDF files.

โ˜๏ธ Note: JSON files are expected to have a uniform schema over all JSON objects. Though Dcipher tries its best to fix non-uniform objects, having arrays and primitive values in the same field of different objects yields an error.

Step-by-step guide

1. Open the data import wizard

Click the "Import" button at the top of the workspace and click "Import data from file".

2. Upload one or multiple files from your device

If you haven't already uploaded the file you want to work with to the Dcipher Cloud, do so by clicking "Upload a file from your device" or drag the file to the drop zone.

For multi-file import, select all the files you want to upload. During upload, they will be merged into a single file.

3. Select file

Select the file you want to import data from and click "Select file".

4. Set the import settings

Dcipher automatically detects sheets and column separators in the file. You can specify a different separator than the one detected in the case of tabular data and select the sheet you want to use in the case of Excel files. No additional setting is required to import PDF and JSON files.

In the case of tabular data, you can select one or multiple data columns to import. By default, all columns will be imported.

Excel file import settings example:

CSV import settings example:

5. Import the data

Click the "Import" button to import the data. Dcipher will automatically detect types of data stored in each column/object. The type schema of the imported data is now displayed in the Schema workbench.

Did this answer your question?