1.4. Text encoding

Lecture

Text information is coded with a binary code through the designation of each character of the alphabet by a specific integer. Using eight binary digits, it is possible to encode 256 different characters. This number of characters is enough to express all the characters of the English and Russian alphabets.

In the early years of computer technology, the difficulties in coding textual information were caused by the lack of necessary coding standards. At the present time, on the contrary, the existing difficulties are connected with a multitude of simultaneously acting and often contradictory standards.

For English, which is an unofficial international communication tool, these difficulties have been resolved. The US Institute for Standardization has developed and introduced the American Standard Code for Information Interchange (ASCII) coding system .

For encoding the Russian alphabet, several variants of encodings were developed:

1) Windows-1251 - introduced by Microsoft; given the wide distribution of operating systems (OS) and other software products of this company in the Russian Federation, it has found wide distribution;

2) KOI-8 (eight-digit Information Exchange Code) is another popular encoding of the Russian alphabet, common in computer networks in the Russian Federation and in the Russian Internet sector;

3) ISO (International Standard Organization - International Institute of Standardization) is an international standard for encoding symbols of the Russian language. In practice, this encoding is rarely used.

The limited set of codes (256) creates difficulties for the developers of a unified system for encoding textual information. As a consequence, it was proposed to encode characters not with 8-bit binary numbers, but with numbers with a large digit, which caused an expansion of the range of possible code values. The system of 16-bit character encoding is called universal - UNICODE. Sixteen digits allows you to provide unique codes for 65,536 characters, which is quite enough to accommodate the characters of most languages in one table.

Despite the simplicity of the proposed approach, the practical transition to this coding system for a long time could not be realized due to the lack of resources of computer equipment, since in the UNICODE coding system all text documents become automatically twice as large. In the late 1990s. the technical tools have reached the required level, the gradual transfer of documents and software to the UNICODE coding system has begun.

Comments

To leave a comment

If you have any suggestion, idea, thanks or comment, feel free to write. We really value feedback and are glad to hear your opinion.

To reply

Comment

To confirm that you are not a bot, answer:

Name

Email(not published)

Vote

Lectures and tutorial on "Informatics"

Terms: Informatics

Info introduction

1.1. The concept of computer science

1.2. Concept of information

1.3. Information coding system. Bit. Byte. Trit. Thrite. Qubit

1.4. Text encoding

1.5. Coding of graphic information

1.6. Audio Coding

1.7. Modes and methods of information transfer

1.8. Information Technology

1.9. Stages of development of information technology

1.10. The advent of computers and computer technology

1.11. The evolution of personal computers

1.12. The structure of modern computing systems

2.1. Classification and device computers

2.3. Memory in personal computers

2.4. The concept of command and computer system software

2.5. Basic input / output system (BIOS). CMOS RAM concept

Hardware and software architecture of IBM-compatible technologies 3.1. Microprocessors

3.2. System boards. Tires, interfaces

3.3. External Device Controls

3.4. Information storage

3.5. Video controllers and monitors

3.6. Input devices

3.7. Information display devices

3.8. Information transfer devices. Other peripherals

Basics of the user in the personal computer operating environment 4.1. Operating Systems

4.2. Software classification

4.3. Purpose of operating systems

4.4. Evolution and characterization of operating systems

4.5. Operating system new technology

4.6. WINDOWS NT Architecture

4.7. Installing WINDOWS NT

4.8. Registry and configuration of the operating system WINDOWS NT

4.9. Features of the operating system WINDOWS

4.10. Network operating systems

4.11. UNIX Operating System Family

4.12. Linux operating system

4.13. Novell Network Operating System Family

5 Basics of working in an environment of local and global computer networks 5.1. Evolution of computer networks

5.2. The main software and hardware components of the network

5.3. Types of local networks

5.4. The organization of the domain structure of the network

5.5. Multi-level approach. Protocol. Interface. Protocol stack

5.6. Organization of accounts. User group management

5.7. Security Policy Management

5.8. Network Resource Management

5.9. Network services

5.10. Means of interoperability with other network operating systems

5.11. Organization of work in a hierarchical network

5.12. Organization of ad hoc networks and technology work in them

5.13. Modem network types

5.14. Modem installation and configuration

5.15. Connection organization with a remote personal computer

5.16. Work with switching (terminal) programs

5.17. Work with a fax modem

6.1. The emergence of the Internet

6.2. Internet capabilities

6.3. Internet work software

6.4. Transmission of information on the Internet. Addressing system

6.5. Internet Addressing and Protocols

6.6. Problems of working in the Internet with Cyrillic texts

6.7. Organization of connection with the provider (Internet access)

6.8. World Wide Web, or WORLD WIDE WEB

6.9. Intranet and Extranet

6.10. Creating a Web Page with Front Page

6.11. FTP File Information Resources

6.12. E-mail

6.13. News, or conferences

6.14. E-commerce. Online store. Internet payment systems

6.15. Online auctions. Internet banking

6.16. Internet insurance. Internet exchange

6.17. Internet Marketing. Internet advertising

6.18 Social Network

7 Basics of working with general purpose applications 7.1. Application Definition

7.2. Text editors

7.3. Tabular processors

7.4. The concept of shell programs

7.5. Graphic editor

7.6. The concept and structure of the data bank

7.7. Organizers

7.8. Presentation programs

7.9. Work on the Internet with MS OFFICE 97 applications

7.10. Stages of solving problems using a computer

8 Specialized professionally oriented software 8.1. Information systems of organizational and economic management

8.2. Modern information technologies in the systems of organizational and economic management

8.3. Information systems of organizational and economic management

8.4. Office activity in systems of organizational and economic management

8.5. Organizational and peripheral information systems tools

8.6. Concept of business graphics

8.7. Use of graphics in business

8.8. Business Graphics Program MS GRAPH

8.9. General characteristics of the technology for creating application software

8.10. Application software

8.11. Software Systems Engineering Technology

8.12. Modern methods and tools for developing application software

9.1. Concept of algorithm

9.2. Programming systems

9.3. Classification of high-level programming languages The main elements of programming languages

9.4. VBA system

9.5. VBA programming language

10.1. Information security as a pattern in the development of computer systems

10.2. Objects and security features in computer data processing systems

9.5 Bug and programming errors

9.6 Tabulation function

10.3. Means of identification and delimitation of access to information

10.4. Cryptographic method of information security

10.5. Computer viruses Classification, methods of infection, treatment and prevention, life cycle

10.6. Antivirus software

10.7. Software Protection

10.8. Securing data on a stand-alone computer

10.9. Data Security in an Interactive Environment

Databases 11.1. The concept of a database. Database Management Systems

11.2. Hierarchical, network and relational data presentation models

11.3. Post-relational, multi-dimensional and object-oriented data representation models

11.4. Classification database management systems

11.5. Database Access Languages

11.6. Databases on the Internet

9.3 Flowcharts Basic algorithmic constructions

Linear group codes (LGK).

UTF-8 character encodings Windows-1251 and others, error recognition

Music Informatics

Mathematical Notations, Prefix, Infix, Postfix, Reverse Polish notation, Polish notation