PHARMACEUTICAL IP

  • Our Offering
    • Fractional in-house
    • Start-ups
    • Established pharma
    • International IP counsel
    • Investors
    • Due diligence
  • Fractional in-house
  • Sectors
    • Pharmaceuticals
    • Biotechnology
    • Biologics
    • Cell & gene therapies
    • AI drug discovery
    • Chemistry
  • Insights
  • About us
    • Our team
    • Join us
    • Contact us
  • Our Offering
    • Fractional in-house
    • Start-ups
    • Established pharma
    • International IP counsel
    • Investors
    • Due diligence
  • Fractional in-house
  • Sectors
    • Pharmaceuticals
    • Biotechnology
    • Biologics
    • Cell & gene therapies
    • AI drug discovery
    • Chemistry
  • Insights
  • About us
    • Our team
    • Join us
    • Contact us
  • Our offering
  • Fractional in-house
  • Sectors
  • Insights
  • Our team
  • Join us
  • Our offering
  • Fractional in-house
  • Sectors
  • Insights
  • Our team
  • Join us

Is it time for patent offices to enter the bioinformatic age?

  • Sector: Biotechnology
  • 13th June 2025
 

In a world in which incalculable amounts of sophisticated sequence data is freely available, are the clunky processes necessary to input patent sequence data really fit-for-purpose?

The dual-purpose of patent sequence listings

All patent applications containing sequence data in the claims, figures or description are required to submit a sequence listing. The sequence listing provides the sequences, together with a unique sequence identification number (SEQ ID NO:) in a prescribed format and additional data such as the organism and the position of any unusual features in the sequence. 

The sequence listing serves two purposes. First, the sequence listing is used by the patent office to search for the sequences disclosed in the patent application. The prescribed format of the sequence listing assists in the automation of this search. Importantly, all the sequences in the patent application must be included in the sequence listing, regardless of whether they are part of the invention or just relate to tool compounds used in the examples. If a sequence is disclosed in the specification, the sequence must be included in the sequence listing (with a few exceptions, such as very short sequences). This allows the patent office to search for all of the disclosed sequences in the application. 

The second function of the sequence listing is to facilitate public access to the sequence information disclosed in patent applications. There is an understandable desire for the sequence data in patents, which may not be published elsewhere, to be searchable in public databases of sequence information such as GenBank. One of the purposes of the shift to ST.26 sequence listing format was to facilitate better integration between patent sequence data and these public databases of sequences. 

Introduction of ST.26

From 1 July 2022, the old international standard for sequence listings, ST.25, was replaced by the new ST.26 standard.  The shift to ST.26, and the introduction of the new WIPO software for preparing ST.26 sequence listings had the aim of increasing public access to patent sequences. The process of preparing ST.26 sequence listings has some improvements over ST.25, but also some new disadvantages. One of the main issues with ST.26 is that the sequence listing is no longer submitted in a human-readable txt format, but instead as a complex XML file. In an effort to address this problem, WIPO has now introduced the ability to visualise ST.26 sequence listings in an internet browser directly from patentscope, so that users no longer have to download the XML sequence listing and import it into WIPO sequence. 

Further information about ST.26 and WIPO sequence can be found at the WIPO Sequence and ST.26 Knowledge Base. Users can also subscribe to the WIPO sequence listing newsletter. 

The risks of errors in patent sequence data

Sequence information is often the most important part of a patent. Many inventions in the biotech field will be defined in the claims by their sequences, and usually by a SEQ ID for that sequence, as provided in the sequence listing. Therefore, if the sequence listing is incorrect even by a single letter, then the claims of the patent may define completely different subject matter to what the applicant intended to claim. In some cases, this could mean that the patent does not cover the commercial embodiment of an invention. 

The decision in T 1213/05 demonstrates the potentially fatal consequences of there being errors in a patent’s sequence data. In this case, the sequences provided in the priority document contained inadvertent sequence errors. The Board of Appeal found that a priority claim for the corrected sequences in the European patent was therefore invalid. The Board of Appeal cited with agreement the reasoning in T 70/05 that a priority claim to an incorrect sequence cannot be maintained, regardless of the reasons for the possible mistakes, either arising from unintended sequencing or typing errors. 

The Board of Appeal T 1213/05 also rejected the patentee’s arguments that the skilled person’s knowledge of a certain margin of error in sequence data permitted there to be some deviation between the sequence in the priority document and the patent application claiming priority. For the Board of Appeal, the DNA sequences had to be identical to relate to “the same invention” and permit a valid priority claim. Claims directed to the corrected sequences were consequently found invalid in view of intervening prior art disclosing the sequences. 

The decision in T 1213/05 shows that anything less than 100% accuracy in the sequence data can be fatal for a patent. In this context, it is worth bearing in mind that sequence data consists of a list of many strings of letters, where each string (“sequence”) can be thousands of letters long. Making a mistake in just one letter out of the potentially millions of letters in a sequence listing can be both at once very easy to do and almost impossible to detect. The burden to applicants in preparing sequence data for a patent application therefore does not just include the time and cost associated with preparing the sequence listing. In order to avoid potentially fatal errors in the sequence data, robust procedures for checking and validating the sequencing listing are also necessary. However, the only way to effectively minimize errors in sequence listings for which there may be hundreds or even thousands of sequences, is to automate the process. 

Automated processes for dealing with sequences – Lessons to be learnt from bioinformatics?

With the arrival of high throughput sequencing, it became necessary for the academic community to devise automated processes for dealing with vast quantities of sequence data and for uploading these sequences to public sequence databases. Compared with the automation tools used by bioinformaticians, the process of preparing and validating sequence listings for patent applications is exceedingly clunky. 

In order to prepare a ST.26 sequence listing it is necessary to input each sequence and its features into the purpose-built WIPO sequence tool. Unlike ST.25, it is possible to import multiple sequences for your ST.26 sequence listing at once, e.g. in FASTA format, instead of copying and pasting each individual sequence. However, the “features” of each sequence in a sequence listing, such as unusual amino acids, still have to be inputted manually to WIPO sequence. The manual process of adding features can take an extraordinary amount of time. The growth in next generation oligonucleotide technologies also means that there is an increasing amount of “unusual” sequence information that must be inputted as features of the sequence. 

In contrast to WIPO sequence, public databases of sequence have purpose-built submission tools that facilitate the upload of vast quantities of annotated sequence information with little manual input. The submission tools for GenBank (e.g. BankIt), for example, allows automated input of sequence information in a format that includes feature information in the form of a 5-column Feature table. 

There is thus a huge disconnect between the automation tools necessary for high-throughput processing of sequence data in academia, and the clunky tools available for preparing patent sequence listing, despite the similar aims of both processes. Aligning patent sequence data submission with the automated processes for submitting sequences to publicly available sequence data would facilitate access to patent sequence data whilst simultaneously improving the process of sequence submission for applicants. 

Final thoughts

The dual-function of the sequence listing, first as a search tool of the patent office and second as a tool for increasing accessibility to patent sequence information, has resulted in a prescriptive sequence listing format that does not satisfactorily fulfil either purpose. Applicants are forced to submit lengthy sequence listings, in which only a small fraction of the sequences actually relate to the invention, using a manual process for inputting feature data that creates a substantial risk of sequence errors. Given that the tools for automating sequence submission to public sequence databases already exist, a radical rethinking of how patent sequence data is called-for. In the bioinformatic age, the present situation by which patent applicants are forced to manually input sequence data would be almost comical, if it didn’t have such potentially dire consequences for the accuracy of patent sequence data. 

Author: Rose Hughes

Related insights...

EBA decides G1/24 on claim interpretation: The description should always be consulted

  • 19th June 2025
The Enlarged Board of Appeal (EBA) has issued its highly anticipated decision in G 1/24, relating to fundamental questions about how patent claims should be interpreted when assessing patentability. The referral arose from diverging Board of Appeal case law on whether the description and drawings should always be used to interpret the claims, or should only…

Freedom to operate versus patentability in biotech: What the difference is and why it matters

  • 9th June 2025
When out and about talking with scientists, I am often asked to explain the difference between freedom to operate (FTO) and patentability. The distinction between FTO and patentability is highly relevant to the field of biotechnology and understanding the difference is crucial. However, whilst the majority of “Intro to IP” talks begin with telling us…

Alexion’s Soliris sequence error saga (Alexion v Samsung Bioepis [2025] EWHC 1240)

  • 4th June 2025
The recent UK High Court decision in Samsung v Alexion [2025] EWHC 1240 (Pat) is a stark reminder that you can never be too careful with patent sequence data. Mr Justice Meade’s ruling in this case highlights that simple mistakes in the sequence information can be fatal for both the enforcement and validity of the claims. The case, relating…

All to play for in CRISPR “first-to-invent” US patent battle as Federal Circuit instructs PTAB to reconsider (Regents v. Broad Institute, Nos. 22-1594, 22-1653)

  • 28th May 2025
In the latest decision of the CRISPR patent wars, the US Court of Appeal of the Federal Circuit delivered its long-awaited opinion in Regents v. Broad Institute (Nos. 22-1594, 22-1653) earlier this month. However, rather than providing a clear resolution to the dispute, the Federal Circuit vacated and remanded the case, extending the already protracted…

Event report: C5 Pharma & Biotech Patent Litigation in Europe (Day 1), Plausibility, UPC and antibody IP strategy

  • 27th May 2025
Last week, I had the pleasure of participating in the 2025 C5 Annual Forum on Pharma & Biotech Patent Litigation in Europe, held in Amsterdam. The conference was a reminder (if ever one was needed) that patent law never stands still. There was a lot of food for thought from the various panels and networking discussions. Some hot topics…

An LLM is not (yet) a person skilled in the art (T 1193/23)

  • 20th May 2025
In the new AI-age, where Large Language Models (LLMs) like ChatGPT can generate sophisticated textual answers in seconds, the relevance of AI to the patent industry is an active area of debate. The recent EPO Board of Appeal in T 1193/23 specifically questioned the reliability of LLM-generated interpretations for technical terms within patent claims. Central to this case…

First UPC decision to tackle infringement and validity of second medical claims (Sanofi v Amgen, UPC_CFI_505/2024)

  • 19th May 2025
The complexities of patenting new uses for known medical substances were recently considered by the Düsseldorf Local Division in the UPC’s first decision on second medical use claims. The case of Sanofi v Amgen (UPC_CFI_505/2024) specifically addressed the novelty and infringement standards applicable to second medical use claims. The case is part of the long-running dispute between Sanofi and…

Defining the undefinable: The challenges and opportunities for cell therapy IP

  • 13th May 2025
IP strategy for cell therapies presents unique challenges compared to traditional pharmaceuticals and biologics. Previous posts have highlighted the limitations of process patents in cell therapy and the need to think strategically about how best to protect the product from fast-follower competition. This post focusses on an even more fundamental challenge, that of the inherent…

Commercial success is a nothing-burger for the EPO in Wegovy patent inventive step analysis (T 1701/22, Obesity treatment with semaglutide)

  • 9th May 2025
In recent years, Novo Nordisk’s weight loss drug semaglutide, marketed as Wegovy for obesity and Ozempic for diabetes, has become a pharmaceutical phenomenon. As with most successful pharmaceutical products, the remarkable success of semaglutide means we can also expect some high profile IP disputes. In jurisdictions lacking provisions for patent term extension, semaglutide is expected to face generic challenge…

The plausibility of diagnostic inventions: No data, no patent (T 0589/22)

  • 30th April 2025
In T 0589/22, the Board of Appeal revoked a patent for an in vitro diagnostic method on the basis that the claimed diagnostic effect was not rendered plausible. According to the Board of Appeal’s interpretation of the claims, none of the examples in the patent related to the population of subjects defined by the claims.…
All Insights

evolve® is a trading entity of Evolve Intellectual Property Limited. Evolve Intellectual Property Limited is regulated by the Intellectual Property Regulation Board (IPReg). Details of the UK professional rules can be found on the IPReg website

registered address: 49 Greek Street, London, England, W1D 4EG

website out of house

© 2025 All Rights Reserved

Keep in touch

Contact Us