To schedule a demonstration, call 1-800-998-4874

Advanced Processing

Using the Cloud to perform eDiscovery allows users can take advantage of distributed processing power on an as-needed basis, accessing extra servers when required and eliminating infrastructure investment.

Rapid Data Explosion with Pre-Processing Analytics

Powerful and fast data explosion, coupled with innovative pre-processing tools allows users to perform early data assessment prior to processing, thus reducing the volume of documents to process and greatly enhance processing speed.

Learn More  

Overview

Run Cavo eD in the AWS Cloud, Private Could or behind your Firewall

Powerful and fast data explosion, coupled with innovative pre-processing tools allows users to perform early data assessment prior to processing, thus reducing the volume of documents to process and greatly enhance processing speed.

 

Details and Benefits

Cavo eD automatically “explodes” email PSTs and folders in a collection during Pre-Processing. This Pre-Processing step for email PSTs enables Cavo eD users to cull the individual emails in the PSTs by their metadata, as well as perform keyword searches on the subject lines of emails. Further analytics allow users to parse information by date, custodian or file types. These culling tools can quickly reduce the amount of data that needs to be processed, saving time and money.

As a result, Cavo eD users have a multitude of Early Data Assessment capabilities prior to Processing.

  • For instance, they can provide counsel with insights into the data: Which custodian wrote emails about what topic when? ; Counsel might determine early on that a critical custodian is missing from the collection, something they otherwise might not have known until after the data is processed.
  • They can analyze the data down to the individual email to see whether the collection is complete.
  • They can take a macro view of the data.
  • Processing prioritization can move perceived important documents to the front of the review line.
  • They can develop an understanding of the content of the email in the corpus.

After Early Data Analysis is finished, Cavo eD users can then select the data to be Processed with great precision and prioritization. No more over- or under-Processing. This can lower costs, risks and time while increasing defensibility.

Boilerplate Customization

Processing Feature Settings can help organize and reduce False Positive Results

Learn More  

Overview

Using the Boilerplate removal option before documents are processed can dramatically reduce the number of false positives that are revealed during search and review.

Details and Benefits

The “Boilerplate” allows user of Cavo eD “boilerplate” text so that the text is not indexed during Processing. The Boilerplate feature allows Cavo eD users to selectively exclude certain information from being ingested into the database. Commonly, this would include things such as the standard disclaimer information that appears at the bottom of every corporate email. Practically, this means that a search for documents with the word “confidential” does not return all documents with a boilerplate statement at the bottom saying, “This document is privileged and confidential.”

Processing Customization

Solve Difficult Problems with Advanced Processing Customization

Learn More  

Overview

The processing template is completely customizable so it is easy to process exactly the information that is needed for each case.

 

Details and Benefits

Every corpus is different, and the demands of every case are different. These differences often conspire to create Processing challenges for even the most seasoned litigation support professional. That’s why Cavo Legal offers its users a number of important ways to customize a Processing job.
For instance, users can select the size characteristics of the embedded objects that are to be processed with or without OCR. So when a case does not require very small embedded objects to be Processed, Cavo eD allows its users to set a size threshold so that these objects do not take up vital Processing time or clutter a document grid with unnecessary items. Certain custodians and date ranges can be prioritized in the processing order to drive the document review in an organized fashion.

Deduplication

Deduplication Identification and Near DeDuplication Options Improve Results

Learn More  

Overview

Cavo eD also enables a number of different de-duplication identification methods to be run concurrently during processing.

Details and Benefits

Users can select from hashing text alone, text plus metadata, text plus metadata and super type (or any combination of the three). Additionally, a near duplication option which strips out white spaces, capitalization and punctuation, leaving a pure text comparison, regardless of formatting differences. By grouping similar documents, users gain a distinct advantage- less time wasted reviewing redundant data and more time available for analyzing unique information critical to the case.

The goals of a Review can sometimes pose unusual challenges for the de-duplication of documents. One method of de-duplication might be perfect for one case and entirely wrong and misleading for another. So Cavo eD allows users to customize de-duplication settings in a myriad of ways, according to whatever combination of methods the user chooses.

Cavo eD displays the results of each Duplication type with a different color in the document grid, immediately identifying what type of duplication the document represents. If the case administrator wants to remove duplicate documents from the review set, they can be “hidden” in the grid so that they do not appear in the population for review purposes. This flexibility allows the case administrator to make judgments on the fly about how to handle duplicates. No matter how demanding the de-duplication requirements of a case, Cavo eD can deliver.

PROCESSING PLUS+

Processing Plus+ includes not only basic document ingestion, but all phases of document processing including duplication identification, deNisting and thematic content creation, all at a current speed of 57 GB per hour!

Learn More  

Overview

Cavo eD Processing Plus+
When Cavo eD quotes you a processing speed, we are quoting a true processing speed that includes all the needed steps in one pass to process the data and prepare it for Search and Review. Our definition of Processing Plus+ is all inclusive, including a full range of capabilities and customization to fit virtually any need. Our current processing rate is over 57 GB per hour, including load time into the Cavo eD platform, ready for Search, Analysis and Review by your team.

Our true distributed processing allows us to simultaneously spread the data processing among multiple servers. When used on the AWS cloud, the number of servers selected to process is a very cost effective way to handle the short term need for lots of machine horsepower.

Details and Benefits

Processing Plus + – We have achieved a processing speed of 57 GB per hour using raw data files from a major case. All steps are included in this speed rate: data loading, exploding PSTs and containers, customized deNist lists, deduplication, boilerplate data removal, theme capturing, email threading, OCR and documents loaded for first pass review or further analytics.
Deployment – Can be deployed on Amazon Web Services which is highly scalable and elastic, delivering exceptional performance at reasonable rates with no upfront hardware costs. Alternatively, should you chose, Cavo eD can also be deployed behind the firewall.
Rapid Data Explosion with Pre-Processing Analytics- Innovative pre-processing tools and data explosion allows users to perform early data assessment prior to processing, thus reducing the volume of documents to process and greatly enhance processing speed.

Cavo eD’s Processing Plus+ includes customizable settings

Thematic Document Capture – better than keywords, our algorithms capture and rank multiple themes in each document, creating a thematic fingerprint which easily searches and classifies documents by their thematic content.
Boilerplate – The “boilerplate template” option can be used to eliminate any language patterns that are unneeded such as standard disclaimer language that appears at the bottom of so many corporate email messages. By reducing the volume of false positive results, the search engine and users can then focus on truly relevant document content that may impact the case.
DeNist – Customizable lists to reduce unneeded data files from being processed.
DeDuplication – multiple Deduplication options are included to provide fully customized options based on the needs of the litigation. Duplicates can be displayed during the review or hidden, based on need.
OCR – Full OCR for all documents without text.
Embedded Objects – Customize the maximum number and sizes of Embedded Objects to include in the searchable data, and whether to display them inline or as separate documents.
Hidden Data – System can display hidden data (e.g., white text, document changes, etc.) after Processing.
Culling Settings – Data can be culled by Type, Custodian, Size and Folder for exclusion or processing priority.