📌 For those reading this article: If you are drafting your statement using generated AI such as ChatGPT, please also use the AI draft x patent attorney check page.
“When I ask a question on ChatGPT, how can I get the best answer?”—this is a question many users have. This technical field, called prompt engineering, has become important in business and is also attracting attention as the subject of patent applications. However, can innovations in how to use AI tools be recognized as patents? In this article, we will explain in detail from a patent attorney's perspective the conditions under which a large-scale language model (LLM) prompt generation technology such as ChatGPT can be patented, using case 38 published by the Japan Patent Office.
In March 2024, the Japan Patent Office expanded the examination cases of AI-related technologies and published 38 cases related to "prompt text generation method." This case shows that the method of using AI tools such as ChatGPT can be patentable, and has important implications not only for AI development companies but also for general companies that utilize AI tools.
In this article, we will explain this case 38 in detail and clarify the difference between "mere use of AI" and "patentable AI utilization technology." By reading this, you will understand the possibility that your company's AI utilization method can be patented, as well as the patent risks of your competitors. Additionally, you can get hints on how to structure your IP strategy for future business development.
Case 38 "Prompt sentence generation method for input to a large-scale language model" has been published as a case study regarding determination of inventive step. This case study is an introduction to techniques for optimizing input to LLM, particularly how to generate effective prompts within the technical constraints of a limited number of characters.
Currently, many LLMs, including OpenAI's GPT series, have an upper limit on the number of tokens (number of characters) that can be entered at one time. For example, GPT-4 has an input limit of 8,192 or 32,768 tokens depending on the model. How to create high-quality prompts within these constraints is an important technical issue in effectively utilizing LLM.
In particular, RAG (Retrieval-Augmented Generation) technology, which improves the accuracy of LLM answers by adding context and reference information, is attracting attention, and technology that efficiently and effectively adds information with a limited number of input tokens can be said to have high practical value.
Case 38 presents the following two claims.
A prompt sentence generation method in which a computer generates a prompt to be input into a large-scale language model by adding reference information to an input question sentence,
The large-scale language model has a character limit that is the upper limit of the number of prompt characters that can be input, and when a prompt including a question sentence is input, an answer sentence regarding the question sentence is output. Based on the input question sentence, an additional sentence generation step that generates an additional sentence related to the question sentence so that the total number of characters including the number of characters of the question sentence is equal to or less than the limited number of characters; a prompt generation step of generating the prompt by adding the additional sentence generated in the additional sentence generation step as the reference information to the input question sentence;
The prompt text generation method according to claim 1, wherein the additional text generation step is a step of obtaining a plurality of related texts related to the input question text based on the input question text, extracting a plurality of keywords suitable as the reference information from the obtained plurality of related texts, and using the plurality of keywords to generate the additional text in which the total number of characters does not exceed the limited number of characters.
From the detailed explanation of the invention in Case 38, the following technical problems and solutions can be understood.
When using large-scale language models, there may be a limit on the number of characters that can be input, and there was a problem in that it was not possible to add an unlimited number of reference information to the input question text. The present invention aims to provide a method for adding valid reference information to question sentences and generating prompt sentences within a predetermined character limit.
The solution of claim 1 is to generate additional text based on the input question text so that the total number of characters including the number of characters in the question text is less than the limit number of characters, and add it to the question text to generate a prompt.
Claim 2 adopts, as a more specific solution, a method of acquiring multiple related sentences related to the question text, extracting keywords suitable for reference information from them, and generating additional sentences using those keywords. Regarding the acquisition of related texts, for example, a method is shown to extract information that is highly relevant to the question text from a database of the questioner's question history, behavior history, purchase history, etc.
The effect of claim 1 is that it is possible to generate a prompt with valid additional sentences added as reference information to the question text input within the character limit.
An additional effect of claim 2 is that it is possible to generate a prompt with additional text that is highly relevant to the question text and suitable as reference information within a predetermined character limit, and that it is possible to obtain more reliable and appropriate answer texts.
Case 38 does not include detailed drawings, but based on the description in the book, the following system configuration is conceptually assumed.
In Case 38, the following cited invention 1 and common general technical knowledge are shown.
A prompt text generation method in which a computer generates a prompt to be input to a large-scale language model by adding reference information to an input question text,
The large-scale language model is a large-scale language model that outputs an answer regarding the question text when a prompt including a question text is input,
The computer: r>An additional sentence generation step of generating an additional sentence related to the question sentence based on the input question sentence;
A prompt generation step of generating the prompt by adding the additional sentence generated by the additional sentence generation step to the input question sentence as the reference information;
In the technical field of language processing, preventing the amount of information processing from becoming excessive is an obvious problem that those skilled in the art usually consider.In addition, as a solution to this problem, it is well known at the time of filing to set a character limit that is the upper limit of the text that can be input, and when the text exceeds the limit, discard the part that exceeds the limit so that the actual input text is created to have the number of characters less than the limit.
Comparing Claim 1 and Cited Invention 1, the following differences are recognized.
In the large-scale language model of the invention according to claim 1, a limit number of characters is set as the upper limit of the number of prompt characters that can be input, and the additional sentence generation step is performed to generate the question text so that the total number of characters including the number of characters of the question text is equal to or less than the limit number of characters. While the large-scale language model of Cited Invention 1 generates additional sentences related to sentences, it is unclear whether or not there is a character limit set that is the upper limit of the number of prompt characters that can be input, and it is unclear whether the additional sentence generation step generates additional sentences as described above.
With regard to this difference, the examiner has determined that, considering the above-mentioned common general knowledge, a person skilled in the art could have easily conceived of generating the additional text so that the number of characters is below the limit.
Specifically, considering the amount of information to be processed is a self-evident issue in the field of language processing, and applying the well-known technique of setting a character limit and discarding the portion exceeding the limit is nothing more than the exercise of the ordinary creative ability of those skilled in the art.
Regarding claim 2, in addition to the above differences, the following differences are recognized.
The additional text generation step of the invention according to claim 2 obtains a plurality of related texts related to the question text based on the input question text, extracts a plurality of keywords suitable as reference information from the obtained plurality of related texts, and uses the plurality of keywords to generate an additional text whose total number of characters does not exceed the limit number of characters, whereas the additional text generation step of the cited invention 1 does not include such specification.
Regarding this difference, the examiner has determined as follows.
No prior art has been discovered that discloses a configuration in which a plurality of related sentences are acquired, keywords suitable for reference information are extracted from them, and additional sentences are generated within the limited number of characters using the keywords, nor is it common general knowledge at the time of filing.
The invention according to claim 2 has an advantageous effect compared to the cited invention 1 in that, due to the configuration, "a prompt with additional text that is highly relevant to the question text and suitable as reference information can be generated within a predetermined limited number of characters, and a more reliable and appropriate answer text can be obtained."
This is not just a design change, but an invention with an inventive step.
Claim 1 of Case 38 is denied inventive step because it merely presents an abstract solution of "generating additional sentences so that the number of characters is below the limit." This is because it is common technical knowledge to consider the character limit, and those skilled in the art can easily come up with the idea of simply keeping it within the character limit.
On the other hand, claim 2 specifically identifies the following three steps.
The description of this specific solution leads to a positive judgment of inventive step.
For LLM-related inventions, it is not enough to simply "use LLM"; it is important to present specific technical means. In particular, it is effective to assert inventive step by adding unique innovations to the preprocessing of input data and postprocessing of output data, and specifying the specific processing flow.
One of the reasons why claim 2 was determined to have inventive step is that the effect of "obtaining a more reliable and appropriate response text" was evaluated as an effect that is difficult to predict based on cited invention 1.
This effect is not just a quantitative effect of cramming information into a limited number of characters, but focuses on the fact that by using keywords extracted from related sentences, it is possible to generate qualitatively superior additional sentences. In other words, we are claiming the effect of qualitative improvement within quantitative constraints (limited number of characters).
For LLM-related inventions, it is also effective to claim effects that go beyond simple automation and efficiency, especially the following effects.
In Case 38, the novel method of ``extracting keywords from related sentences and generating additional sentences'' was judged to be an inventive step, as opposed to the well-known technique of ``setting a limit on the number of characters and discarding the part that exceeds it''.
From this judgment, it is clear that it is important to clarify the points of differentiation from well-known technologies, such as the following, even in LLM-related inventions.
The following points can be learned from Case 38 when writing claims for LLM-related inventions.
Rather than simply improving efficiency and automation, we will set specific technical issues in the use of LLM. For example:
Describe the steps of a concrete solution, not an abstract one. As in Claim 2 of Case 38, it is effective to describe a solution that combines multiple steps (obtaining related sentences → extracting keywords → generating additional sentences).
It is important to include technical features that support the effect in the claim, such as "extracting keywords suitable for reference information." This allows us to claim that it is not just a design change, but a technical ingenuity aimed at a specific outcome.
Although Case 38 shows only a method invention, in an actual application, you should consider preparing multiple independent claims as shown below.
The following points are particularly important when preparing specifications for LLM-related inventions.
Specifically point out the problems of the prior art and clearly explain the technical issues that the present invention attempts to solve. In Case 38, the issue of "existence of character limit" is clearly stated. For LLM-related inventions, it is important to specifically describe the following issues.
We will describe concrete implementations rather than abstract explanations. It is important to explain in detail the method for acquiring related sentences, the keyword extraction algorithm, the sentence generation process, etc.
Specifically, it is effective to include the following contents.
It is effective to support the effect of ``obtaining more reliable and appropriate response sentences'' with specific experimental results or comparative data. For example, you should consider including supporting information such as:
By clarifying the specific technical field to which the LLM-related invention is applied (e.g. customer support, medical diagnosis, legal consultation, etc.) and describing the challenges and effects specific to that field, patentability can be expected to improve.
From the analysis of Case 38, the following are important points in obtaining rights for LLM-related inventions.
In patent applications for LLM-related inventions, it is difficult to recognize inventive step simply by using LLM. However, as Case 38 shows, by adding specific technical innovations such as preprocessing input data and optimizing output data, it is possible to obtain a patent with an inventive step.
In recent years, various services utilizing LLM such as ChatGPT and Gemini have appeared, but in order to differentiate them and secure competitive advantage, it is essential to strategically obtain rights for LLM-related inventions. In particular, technologies that achieve higher quality LLM usage by adding ingenuity to preprocessing input data, as in Case 38, will become increasingly important in the future.
Our firm has a wealth of experience in filing patent applications for AI/LLM-related inventions, and supports the planning of patent acquisition strategies based on the latest examination trends. If you are concerned about patent protection for a new business that utilizes large-scale language models, please feel free to contact us.
🤖 For those who created a draft of a patent specification with ChatGPT
You are welcome to bring in your AI drafts. A patent attorney who specializes in IT and AI patents will rewrite the generated AI draft into a "strong title deed that protects your business."
Leave the final check of the statement quality to a professional with over 15 years of experience in intellectual property practice.
AUTHOR
Takefumi SUGIURA (杉浦 健文)
EVORIX Intellectual Property Law Firm Managing Patent Attorney
Supports clients across IT, manufacturing, startups, fashion, and medical industries, covering patent, trademark, design, and copyright filings through trials and infringement litigation. Specialized in IP strategy for AI, IoT, Web3, and FinTech. Member of the Japan Patent Attorneys Association (JPAA), Asian Patent Attorneys Association (APAA), and Japan Trademark Association (JTA).