Import FAQs

Follow

Topics Below: 

  


Is there a restriction on the size of my files?

Although we can accept files up to 5GB, we recommend that you keep your file sizes smaller than 1GB. Keeping files more compact has advantages:

  1. Smaller files take less time to upload, populating your data quicker while reducing the chance of a network interruption disrupting your upload.

  2. For privacy and security reasons, some of our validation cannot be conducted until your file has been completely received by our systems. Avoiding extremely large files shortens the time to validation.

Do I have to load by custodian?

No, you are not required to load by custodian, but it is recommended to utilize the Custodian assignment feature during import.  It is important to apply the custodian to an import batch so users can 1) analyze, search and isolate documents for particular custodian(s) and 2) include this information in a production export.


Import times seem to vary, why is that?

The largest obstacle to faster uploads is your network connection. In many cases uploads will be significantly faster at work (business lines are typically larger than at home). You may also notice a small performance boost when connected to the network by wire (vs. wireless).


Why is there additional processing time after my upload completes?

When your upload has completed, our server has received your entire file. At that point, final validation is performed before your file is cataloged and stored. During your upload we display an estimate of how much additional time this will take (after the upload status bar is full).


Can I navigate away from the upload screen before it completes?

When uploading to the File Room, the answer to this is No. Once an upload has started, navigating away from the upload page will cause any progress to be lost.

If you would like to continue working, open a new browser window or tab. You can continue your work there while your file upload window continues in the background.

When importing through DATA > Import, once a batch import has been initiated and is queued for processing you can navigate away and importing will proceed as expected.


What is standard import and extended import? 

Standard imports meet the following criteria:

  • Documents as images with corresponding load files. Images must be named as contained in load file. Up to 3 load files per GB of data and a maximum of 25 database fields

OR

  • Native files without additional coding. No load file required. Import includes custodian (if listed), folder path from received media, and document metadata.

Extended imports do not meet the above requirements and require an additional Client Success Services estimate before import.


Scanning specifications for uploading documents

To make for an easy batch upload, follow these guidelines when scanning your documents.

Standard Specifications

  1. Logical document unitization/breaks must be captured and maintained
  2. Document relationships must be conveyed, including bound documents
    1. Relationship information must be populated in loadfile through Begattach/Endattach fields
  3. Maintain following “source” information, if applicable:
    1. Custodian;
    2. Box Number;
    3. Folder/Binder Name;
    4. And any other contextual information the parties involved may find useful
  4. All photographs, charts, graphs, and any other document where there would be a loss of integrity if the original format was not preserved, must be scanned in color.  All other documents can be in black and white.
  5. Scan in direct size proportions (i.e., size for size)
  6. Scan as text reads (i.e., vertical v. horizontal)
  7. All covers, spines, tabs, standard language, duplicate carbons, annotations not directly on the document (i.e., Post-Its), etc. must be scanned on their own page, with a relationship indication to the document(s) it is referencing on the load file provided
  8. Any media found must be discussed amongst parties involved for proper protocol
  9. Any additional non-standard scanning metadata must be agreed to amongst parties prior to any scanning

Electronic Format 

  1. General Considerations:
    1. All data must be delivered in a structured format
    2. All scanned collections should be converted to TIFF images, affiliated with a control number, and include fully searchable text files
    3. File names cannot contain embedded spaces or special characters (including the comma)
  2. Images:
    1. Black and White - 300 DPI; Group IV; Single-Page TIFF Files
    2. Color - JPEG
    3. All TIFF images must have a unique file name, correlating to the control number in the load file (I.e. Bates number)
    4. The number of TIFF files per folder should not exceed 500 files
  3. Text:
    1. A text path field must be included in the load file, providing the path and name of the extracted text file corresponding to each document
    2. Each text file must be named by the same control number as the image file it corresponds to
    3. Do not include the actual text in the load file
    4. The number of text files per folder should not exceed 1,000 files
  4. Load File:
    1. File Format:  CSV or DAT accepted
    2. First line of loadfile must be a header row, identifying each field name provided
    3. Date fields must be provided in the following format: mm/dd/yyyy 
0 out of 0 found this helpful

Comments

0 comments

Article is closed for comments.