Is it time for patent offices to enter the bioinformatic age?

Sector: Biotechnology

13th June 2025

In a world in which incalculable amounts of sophisticated sequence data is freely available, are the clunky processes necessary to input patent sequence data really fit-for-purpose?

Originally published on IPKat.

The dual-purpose of patent sequence listings

All patent applications containing sequence data in the claims, figures or description are required to submit a sequence listing. The sequence listing provides the sequences, together with a unique sequence identification number (SEQ ID NO:) in a prescribed format and additional data such as the organism and the position of any unusual features in the sequence.

The sequence listing serves two purposes. First, the sequence listing is used by the patent office to search for the sequences disclosed in the patent application. The prescribed format of the sequence listing assists in the automation of this search. Importantly, all the sequences in the patent application must be included in the sequence listing, regardless of whether they are part of the invention or just relate to tool compounds used in the examples. If a sequence is disclosed in the specification, the sequence must be included in the sequence listing (with a few exceptions, such as very short sequences). This allows the patent office to search for all of the disclosed sequences in the application.

The second function of the sequence listing is to facilitate public access to the sequence information disclosed in patent applications. There is an understandable desire for the sequence data in patents, which may not be published elsewhere, to be searchable in public databases of sequence information such as GenBank. One of the purposes of the shift to ST.26 sequence listing format was to facilitate better integration between patent sequence data and these public databases of sequences.

Introduction of ST.26

From 1 July 2022, the old international standard for sequence listings, ST.25, was replaced by the new ST.26 standard. The shift to ST.26, and the introduction of the new WIPO software for preparing ST.26 sequence listings had the aim of increasing public access to patent sequences. The process of preparing ST.26 sequence listings has some improvements over ST.25, but also some new disadvantages. One of the main issues with ST.26 is that the sequence listing is no longer submitted in a human-readable txt format, but instead as a complex XML file. In an effort to address this problem, WIPO has now introduced the ability to visualise ST.26 sequence listings in an internet browser directly from patentscope, so that users no longer have to download the XML sequence listing and import it into WIPO sequence.

Further information about ST.26 and WIPO sequence can be found at the WIPO Sequence and ST.26 Knowledge Base. Users can also subscribe to the WIPO sequence listing newsletter.

The risks of errors in patent sequence data

Sequence information is often the most important part of a patent. Many inventions in the biotech field will be defined in the claims by their sequences, and usually by a SEQ ID for that sequence, as provided in the sequence listing. Therefore, if the sequence listing is incorrect even by a single letter, then the claims of the patent may define completely different subject matter to what the applicant intended to claim. In some cases, this could mean that the patent does not cover the commercial embodiment of an invention.

The decision in T 1213/05 demonstrates the potentially fatal consequences of there being errors in a patent’s sequence data. In this case, the sequences provided in the priority document contained inadvertent sequence errors. The Board of Appeal found that a priority claim for the corrected sequences in the European patent was therefore invalid. The Board of Appeal cited with agreement the reasoning in T 70/05 that a priority claim to an incorrect sequence cannot be maintained, regardless of the reasons for the possible mistakes, either arising from unintended sequencing or typing errors.

The Board of Appeal T 1213/05 also rejected the patentee’s arguments that the skilled person’s knowledge of a certain margin of error in sequence data permitted there to be some deviation between the sequence in the priority document and the patent application claiming priority. For the Board of Appeal, the DNA sequences had to be identical to relate to “the same invention” and permit a valid priority claim. Claims directed to the corrected sequences were consequently found invalid in view of intervening prior art disclosing the sequences.

The decision in T 1213/05 shows that anything less than 100% accuracy in the sequence data can be fatal for a patent. In this context, it is worth bearing in mind that sequence data consists of a list of many strings of letters, where each string (“sequence”) can be thousands of letters long. Making a mistake in just one letter out of the potentially millions of letters in a sequence listing can be both at once very easy to do and almost impossible to detect. The burden to applicants in preparing sequence data for a patent application therefore does not just include the time and cost associated with preparing the sequence listing. In order to avoid potentially fatal errors in the sequence data, robust procedures for checking and validating the sequencing listing are also necessary. However, the only way to effectively minimize errors in sequence listings for which there may be hundreds or even thousands of sequences, is to automate the process.

Automated processes for dealing with sequences – Lessons to be learnt from bioinformatics?

With the arrival of high throughput sequencing, it became necessary for the academic community to devise automated processes for dealing with vast quantities of sequence data and for uploading these sequences to public sequence databases. Compared with the automation tools used by bioinformaticians, the process of preparing and validating sequence listings for patent applications is exceedingly clunky.

In order to prepare a ST.26 sequence listing it is necessary to input each sequence and its features into the purpose-built WIPO sequence tool. Unlike ST.25, it is possible to import multiple sequences for your ST.26 sequence listing at once, e.g. in FASTA format, instead of copying and pasting each individual sequence. However, the “features” of each sequence in a sequence listing, such as unusual amino acids, still have to be inputted manually to WIPO sequence. The manual process of adding features can take an extraordinary amount of time. The growth in next generation oligonucleotide technologies also means that there is an increasing amount of “unusual” sequence information that must be inputted as features of the sequence.

In contrast to WIPO sequence, public databases of sequence have purpose-built submission tools that facilitate the upload of vast quantities of annotated sequence information with little manual input. The submission tools for GenBank (e.g. BankIt), for example, allows automated input of sequence information in a format that includes feature information in the form of a 5-column Feature table.

There is thus a huge disconnect between the automation tools necessary for high-throughput processing of sequence data in academia, and the clunky tools available for preparing patent sequence listing, despite the similar aims of both processes. Aligning patent sequence data submission with the automated processes for submitting sequences to publicly available sequence data would facilitate access to patent sequence data whilst simultaneously improving the process of sequence submission for applicants.

Final thoughts

The dual-function of the sequence listing, first as a search tool of the patent office and second as a tool for increasing accessibility to patent sequence information, has resulted in a prescriptive sequence listing format that does not satisfactorily fulfil either purpose. Applicants are forced to submit lengthy sequence listings, in which only a small fraction of the sequences actually relate to the invention, using a manual process for inputting feature data that creates a substantial risk of sequence errors. Given that the tools for automating sequence submission to public sequence databases already exist, a radical rethinking of how patent sequence data is called-for. In the bioinformatic age, the present situation by which patent applicants are forced to manually input sequence data would be almost comical, if it didn’t have such potentially dire consequences for the accuracy of patent sequence data.

Author: Rose Hughes

Related insights...

Mechanistic insights supporting the sufficiency and inventive step of a therapeutic use (without clinical data) (T 1601/22)

24th February 2026

At the EPO, it is perfectly possible for a therapeutic invention to survive without clinical data. The recent decision in T 1601/22 confirmed this sometimes surprisingly low bar for sufficiency in Europe for therapeutic invention.

Non-reproducible products can be the closest prior art (T 1719/21)

17th February 2026

G 1/23 establishes that products made available to the public are prior art in Europe, regardless of reproducibility. While this simplifies novelty, focusing strictly on disclosure dates, it complicates inventive step assessments. Notably, T 1719/21 questions whether these non-reproducible products can serve as the “closest prior art” in the EPO’s problem-solution approach.

ViCo oral proceedings: Whatever happened to the in-person “Gold-Standard”?

14th December 2025

We are now many years on from the pandemic conditions that initially led to the introduction of oral proceedings by video conferencing for Board of Appeal cases. But what happened to the “Gold Standard” of in-person proceedings promised by the EBA in G 1/21?

EPO pharma case law trends 2025: Clinical inventions

26th November 2025

The law in the pharma sector field is also constantly evolving. Understanding the case law trends when drafting, prosecuting and defending these cases is therefore paramount. In our second post on EPO pharma case law trends in 2025 (see Evolve Insights), we review the most impactful decisions of the year relating to clinical-stage inventions.

EPO pharma case law trends 2025: Antibodies and biologics

19th November 2025

The science of biologics is rapidly progressing, with the development of ever more complex protein structures, incorporation of molecules into cell therapies and the increasing use of AI-assisted design and in silico modelling. Patent law must respond to these new challenges. What better time to take a look at the trends from the EPO case…

Insufficiency resulting from mutually exclusive definitions: The repercussive effect of dependent claims (T 0878/23)

18th November 2025

In T 0878/23, the Board of Appeal ruled that mutually exclusive ranges in dependent claims constitute fatal insufficiency rather than a mere lack of clarity. This decision underscores the “repercussive effect” of claim dependencies, warning that internal contradictions can make an invention technically impossible to perform.

First use of G 1/24 to broaden clear claim language (T 1849/23)

16th November 2025

This significant decision is the first from the Boards of Appeal to apply G 1/24 to the use of information from the description to broaden otherwise clear claim language.

Sufficiency at the priority date: A study protocol is not “the same” as a therapeutic effect invention (T0883/23)

31st October 2025

Therapeutic inventions are generally not considered sufficiently disclosed absent supporting data. The recent decision in T 0883/23 found that this applies both at the priority date and the filing date of the patent.

Patentee’s own post-published data undermines the credibility of their broad cat antibody patent (T 0709/23)

27th October 2025

How early is too early to file a biotech patent? EPO decision T 0709/23 provides a costly answer, demonstrating the fatal risks of claiming a broad therapeutic use before the link between structure, function, and actual effect is truly understood.

Use of AI in the patent industry: The spectre of hallucination

10th October 2025

What are the risks of AI hallucinations for the patent industry?

PHARMACEUTICAL IP