In this decision, the European Patent Office did not grant a patent on the concept of automatically generating a list of expressions semantically related to an input linguistic expression. Here are the practical takeaways of the decision T 1569/05 (Method for retrieving data/CANON) of 26.6.2008 of Technical Board of Appeal 3.5.01:


If a method is computer-implemented, it is considered technical.

However, features relating to automatically generating a list of expressions semantically related to an input linguistic expression is basically not of a technical nature but a matter of the meaning of those expressions, i.e. of their abstract linguistic information content.

The invention

This European patent application generally relates to a data retrieving apparatus and method thereof. More particularly, it relates to a database system or an interface system between a database system and users (cf. EP 0 822 506 A2, page 1, lines 3-4). In the retrieving operation as used in common pattern matching systems, a user is unable to retrieve data having the same meaning but represented by the different representation forms, or data having similar meanings. Moreover, pattern matching cannot deal with polysemy of words (cf. EP 0 822 506 A2, page 1, lines 7-12). The stored data could be of any kind but for simplicity, it is assumed they represent images (in accordance with the third embodiment of the invention). The Board in charge summarized the subject-matter of the underlying application as follows:

The invention is a data processing method (claim 13) and apparatus (claim 1) for searching a database. The stored data could be of any kind but for illustration it is here assumed they represent images in accordance with the third embodiment of the invention. Each image is described by a number of words (“comparison-subjected word group”) representing its contents. A user searching for an image inputs a keyword as well as a number of “context words” intended to define the appropriate semantic context. The keyword, the context words and the comparison-subjected words are transformed to vectors in what is referred to as “semantic space”. This space has been created using eigenvalue decomposition of “space generation words”, taken for example from a dictionary. The context vectors form a “semantic center”, which is a subspace of semantic space corresponding to the given context. The semantic center does not include (…) the axes corresponding to the most frequent meanings of the space generation words. The keyword vector and the comparison-subjected vector group are projected onto the semantic center and the distances (“correlation amounts”) between the keyword vector and the comparison-subjected vectors are calculated. The closest comparison-subjected vector is identified and the corresponding image retrieved from the database (see also p.4, l.44 to p.11, l.9 of the A-publication).

Fig. 3 of EP 0 822 506 A2

Here is how the invention is defined in claim 13:

  • Claim 13 (main request)

Is it technical?

The first-instance examining division had refused the patent application for lack of inventive step in light of the cited prior art. In reaction thereto, the applicants appealed the decision.

With respect to technicality, the Board in charge stated as follows:

Claim 13 is directed to a “computer-implemented method… performed by a computer”. A computer being a technical means, the subject-matter of claim 13 is an invention within the meaning of Article 52(1) EPC.

The appellants agreed to the Board’s assessment, that the subject-matter of claim 13 of the main requests differs from the teachings of the closest prior art in that:

– a principal-axis index set is generated by calculating a sum vector of the space generation vector group and selecting an axis of the sum vector as the principal-axis index set if an absolute value of the corresponding element satisfies a condition for a ratio to an absolute value of a succeeding element in descending order of the absolute values, and

– the subspace into which the projector projects a vector contains no axes belonging to the principal-axis index set.

The Board summarized the teaching of these two distinguishing features as follows:

3.2 (…) Hence, in essence the claimed data processing method differs from the prior art by a modification of the mathematical model of meaning used for data retrieval. Put simply, common elements of meaning, having no distinguishing power, are determined, and the corresponding axes are excluded from the subspace (“semantic center”) where the correlations between the keyword and the image descriptions (“comparison-subjected word group”) are evaluated.

However, the Board agreed to the assessment of the first-instance examining division and considered the distinguishing features as non-technical:

3.3 Also the examining division found that the above two features (as they were then formulated) distinguished the invention from D3 (cf the decision under appeal, point 1.1). In the division’s opinion, the features merely caused a further restriction of the subspace to be searched (cf the decision under appeal, point 1.2). This was a technically non-functional modification of the known “mathematical model of meaning”, relating to the field of linguistics. The invention thus did not involve an inventive step.

In response, the appellants argued that the application relates to the technical field of utilizing a natural language as a search input, as allegedly confirmed by T 208/84. However, the Board in charge did not follow these arguments and argued that the present case could not be compared to the decision referred to by the appellants:

3.5 In the Board’s view, (…) the modified model according to the invention [is not] within the technical area, since only the meaning of the words determines how they are represented, stored and selected, and since mathematical algorithms completely define the processing.

3.6 A technical aspect can therefore at most be seen in the application of these models for retrieving data in a computer database, such retrieval being normally considered to have technical character.

In this respect, the Board further argued that using such a modified model would be obvious in light of the cited prior art:

3.8 (…) To use such a modified model for data retrieval is obvious in the light of D3. Search efficiency is a standard problem in data retrieval applications and any modification leading to faster and arguably better search results would be clearly desirable.

As a result, the Board in charge dismissed the appeal due to lack of inventive step.

More information

You can read the whole decision here: T 1569/05 (Method for retrieving data/CANON) of 26.6.2008.

