Method for detecting current client-side browser encoding
First Claim
1. A method for detecting character set (also known as character encoding) currently selected on the browser on the world wide web client computer system, comprising:
- a sample Unicode string that contains a set of test character codes which is independent of current client encoding;
a plurality of instructions comparing parts of sample Unicode strings with characters or sequences of characters directly encoded using various encodings to be detected;
a function that returns the currently selected encoding.
0 Assignments
0 Petitions
Accused Products
Abstract
In order to make the world wide web pages adaptable to the user language and encoding, a method is provided such that the current encoding set on the client browser can be detected within the page being browsed, making it possible to feed-back the encoding to the server side, and also to adapt the page to the language that is most likely to match the native language of the user. To provide this, sample Unicode strings are matched against encoding-specific string values, which are selected in such a way that the match uniquely determines the encoding being currently set. Ordinarily users around the world do not change this setting and often are not aware of it. When the forms are passed back to the server, knowing the encoding of the form data makes it possible to correctly parse the form data and pass them correctly to search engines, to databases, or to other servers.
-
Citations
5 Claims
-
1. A method for detecting character set (also known as character encoding) currently selected on the browser on the world wide web client computer system, comprising:
- a sample Unicode string that contains a set of test character codes which is independent of current client encoding;
a plurality of instructions comparing parts of sample Unicode strings with characters or sequences of characters directly encoded using various encodings to be detected;
a function that returns the currently selected encoding. - View Dependent Claims (2, 3, 4, 5)
- a sample Unicode string that contains a set of test character codes which is independent of current client encoding;
Specification