SequenceBase Corporation

Big data IP solution

  • 2008 till NOW

Company Profile

The SequenceBase Corporation is one of the leaders in providing patent sequence information to the biotechnology, legal, pharmaceutical, scientific, technical and academic bioinformatics communities.

Business Situation

The importance of DNA and protein research is critical to the discovery of new drugs and vaccines, genetic therapies, and sustainable agriculture. Intellectual property experts say that about 80% of the information published in a patent document is not available anywhere else. There are no lexicographers who monitor the publication process, so inventors can use whatever language they choose. Missing details in genetics research is not an option. Intellectual property experts had been expressing the need for a single source of sequence data for a long time.


As a first step, a new product has been developed to cover all available genetic sequences from the published applications and issued patents of the USPTO dating back to 1982. Each database record contains a sequence and related data including organism name, sequence length and tables for modifications, and other features. Bibliographic and text search options, including publication title, abstract, patent assignees at issue, full inventor names plus the complete set of publication, application, and parent case WIPO/PCT numbers and dates are also provided.

As a second step, we have developed SequenceBase® BLAST® Search Portal – a web-based access point for comprehensive patent sequence searching to handle various sequence databases. It is used by pharmaceutical firms, biotech companies, academia and law firms as a solution for all their Intellectual Property (IP) sequence searching needs, including patentability, freedom-to-operate, patent infringement, validity, and business intelligence.

Since the volume of legal and scientific information grows exponentially each year, we’ve developed a long term Big Data strategy to handle the growing amount of data and its processing needs. The system utilizes cloud, distributed processing and scaling technologies to allow blazing fast data delivery. SequenceBase was the first company to announce “same day” data delivery to their clients.


Scientists and legal professionals carrying out essential IP research are seeking the best and simplest way to discover IP sequence information across disparate resources. In the past, their alternatives relied on outsourcing or stepping carefully through multiple, databases — workflows, which often left them with major difficulties to overcome:

  • Is this search really complete?
  • The volume of results is overwhelming!
  • Time taken for results analysis inhibits the IP sequence search process
  • Sifting through duplicated results from multiple databases is slow and inefficient
  • Outsourcing is expensive and hard to integrate into workflows
  • It’s hard to share and report results in a simple accessible way

To overcome these challenges, IP specialists are increasingly turning to the SequenceBase® BLAST® Search Portal, the online IP research solution with easy-to-use, readily accessible content, search, analysis, and reporting tools.

Talk about your PRODUCT
  • Skills:

    • Big Data
    • Cloud
    • Ruby
    • Postgres
    • Solr
    • Javascript
  • Some detailed information not disclosed due to NDA restrictions