Coding schemes translate real-world data into a form that computers can process easily.

Abstract

In this paper, I describe a new, powerful technique for coding data from textual sources, a technique based on concepts developed in the field of linguistics, particularly on the concept of semantic text grammar. I develop a grammar of data collection from the simple linguistic canonical structure noun phrase/verb phrase, namely, subject/action/object and their modifiers. I show that the grammar allows researchers to collect richer and more flexible data than more traditional coding schemes. In particular, the grammar produces coded output that, to a large extent, preserves both the syntax and the lexicon of the source material. Furthermore, the grammar makes the process of data collection independent of any prior specification of hypotheses, because virtually all relevant information provided by the sources can be easily coded. Finally, the grammar represents a general class of coding schemes. A set of well-known schemes used to analyze textual material can be subsumed under the grammar.

Journal Information

Sociological Methodology (SM) is the only American Sociological Association periodical publication devoted entirely to research methods. It is a compendium of new and sometimes controversial advances in social science methodology. Contributions come from diverse areas and have something new and useful--and sometimes surprising--to say about a wide range of methodological topics. SM seeks qualitative and quantitative contributions that address the full range of methodological problems confronted by empirical research in the social sciences, including conceptualization, data analysis, data collection, measurement, modeling, and research design. The journal provides a forum for engaging the philosophical issues that underpin sociological research. Papers published in SM are original methodological contributions, including new methodological developments, reviews or illustrations of recent developments that provide new methodological insights, and critical evaluative discussions of research practices and traditions. SM encourages the inclusion of applications to real-world sociological data. SM is published annually as an edited, hardbound book.

Publisher Information

American Sociological Association Mission Statement: Serving Sociologists in Their Work Advancing Sociology as a Science and Profession Promoting the Contributions and Use of Sociology to Society The American Sociological Association (ASA), founded in 1905, is a non-profit membership association dedicated to advancing sociology as a scientific discipline and profession serving the public good. With over 13,200 members, ASA encompasses sociologists who are faculty members at colleges and universities, researchers, practitioners, and students. About 20 percent of the members work in government, business, or non-profit organizations. As the national organization for sociologists, the American Sociological Association, through its Executive Office, is well positioned to provide a unique set of services to its members and to promote the vitality, visibility, and diversity of the discipline. Working at the national and international levels, the Association aims to articulate policy and impleme nt programs likely to have the broadest possible impact for sociology now and in the future.

Rights & Usage

This item is part of a JSTOR Collection.
For terms and use, please refer to our Terms and Conditions
Sociological Methodology © 1989 American Sociological Association
Request Permissions

What is encoding and decoding in a computer?

Encoding and decoding are used in many forms of communications, including computing, data communications, programming, digital electronics and human communications. These two processes involve changing the format of content for optimal transmission or storage.

In computers, encoding is the process of putting a sequence of characters (letters, numbers, punctuation, and certain symbols) into a specialized format for efficient transmission or storage. Decoding is the opposite process -- the conversion of an encoded format back into the original sequence of characters.

These terms should not be confused with encryption and decryption, which focus on hiding and securing data. (We can encrypt data without changing the code or encode data without deliberately concealing the content.)

What is encoding and decoding in data communications?

Encoding and decoding processes for data communications have interesting origins. For example, Morse code emerged in 1838 when Samuel Morse created standardized sequences of two signal durations, called dots and dashes, for use with the telegraph. Today's amateur radio operators still use Q-signals, which evolved from codes the British Postmaster General created in the early 1900s to ease communication among British ships and coast stations.

Manchester encoding was developed for storing data on magnetic drums of the Manchester Mark 1 computer, built in 1949. In that encoding model, each binary digit, or bit, is encoded low then high, or high then low, for equal time. Also known as phase encoding, the Manchester process of encoding is used in consumer infrared protocols, radio frequency identification and near-field communication.

What is encoding and decoding in programming?

Internet access relies on encoding. A Uniform Resource Locator (URL), the address of a webpage, can only be sent over the internet using the American Standard Code for Information Interchange (ASCII), which is the code used for text files in computing.

Coding schemes translate real-world data into a form that computers can process easily.
Here is an example of ASCII encoding for the string

In an ASCII file, a 7-bit binary number represents each character, which can be uppercase or lowercase letters, numbers, punctuation marks and other common symbols. However, URLs cannot contain spaces and often have characters that aren't in the ASCII character set. URL encoding, also called percent encoding, addresses this through the conversion of spaces -- to a + sign or with %20 -- and non-ASCII characters into a valid ASCII format.

Other commonly used codes in programming include BinHex, Multipurpose Internet Mail Extensions, Unicode and Uuencode.

Some ways encoding and decoding are used in various programming languages include the following.

In Java

Encoding and decoding in Java is a method of representing data in a different format to efficiently transfer information through a network or the web. The encoder converts data into a web representation. Once received, the decoder converts the web representation data into its original format.

In Python

In the Python programming language, encoding represents a Unicode string as a string of bytes. This commonly occurs when you transfer an instance over a network or save it to a disk file. Decoding transforms a string of bytes into a Unicode string. This happens when you receive a string of bytes from a disk file or the network.

In Swift

In the Apple Swift programming language, encoding and decoding models typically represent a serialization of object data from a JavaScript Object Notation string format. In this case, encoding represents serialization, while decoding signifies deserialization. Whenever you serialize data, you convert it into an easily transportable format. Once transported, it converts back into its original format. This approach standardizes the protocol and enables interoperability between different programming languages and platforms.

What is encoding and decoding in digital electronics?

In electronics, the terms encoding and decoding reference analog-to-digital conversion and digital-to-analog conversion. These terms can apply to any form of data, including text, images, audio, video, multimedia and software, and to signals in sensors, telemetry and control systems.

What is encoding and decoding in human communication?

People don't think about it as an encoding or decoding process, but human communication begins when a sender formulates (encodes) a message. They choose the message they will convey and a communication channel. People do this every day with little thought to the encoding process.

The receiver must make sense of (decode) the message by deducing the meaning of words and phrases to interpret the message correctly. They then can provide feedback to the sender.

Both the sender and receiver in any communication process must deal with noise that can get in the way of the communication process. Noise involves the various ways that messages get disrupted, distorted or delayed. These can include actual physiological noise, technical problems or semantic, psychological and cultural issues that get in the way of communication.

Coding schemes translate real-world data into a form that computers can process easily.
Encoding and decoding are an integral part of all communication.

These processes occur almost instantly in any of these three models:

  1. Transmission model. This model of communication is a linear process where a sender transmits a message to a receiver.
  2. Interaction model. In this model, participants take turns as senders and receivers.
  3. Transaction model. Here, communicators generate social realities within cultural, relational and social contexts. They communicate to create a relationship, engage with communities and form intercultural alliances. In this model, participants are labeled as communicators, not senders and receivers.

Decoding messages in your native tongue feels effortless. When the language is unfamiliar, however, the receiver may need a translator or tools like Google Translate for decoding the message.

Beyond the basics of encoding and decoding, machine translation capabilities have made significant progress of late. Find out more about machine translation technology and tools.

This was last updated in July 2021

Continue Reading About encoding and decoding

  • 5 essential programming languages for cybersecurity pros
  • A primer on DNA data storage and its potential uses
  • 2 data-wrangling techniques for better machine learning
  • Developers, get 'image conscious' for royalty-free image codec JPEG XL
  • 10 best programming languages to learn

Dig Deeper on Network Infrastructure

  • Coding schemes translate real-world data into a form that computers can process easily.
    Rich Text Format (RTF)

    Coding schemes translate real-world data into a form that computers can process easily.

    By: Katie Terrell Hanna

  • Coding schemes translate real-world data into a form that computers can process easily.
    binary

    Coding schemes translate real-world data into a form that computers can process easily.

    By: Rahul Awati

  • Coding schemes translate real-world data into a form that computers can process easily.
    codec

    Coding schemes translate real-world data into a form that computers can process easily.

    By: Alexander Gillis

  • Coding schemes translate real-world data into a form that computers can process easily.
    telegraph

    Coding schemes translate real-world data into a form that computers can process easily.

    By: Gavin Wright

Which components work together to perform processing operations?

Processor, called the central processing unit (CPU), Microprocessor. Its contain a control unit & an arithmetic logic unit (ALU). These 2 components work together to perform processing operations.

What is used to transfer information about where the data should reside in memory?

Techopedia Explains Data Bus A data bus can transfer data to and from the memory of a computer, or into or out of the central processing unit (CPU) that acts as the device's "engine." A data bus can also transfer information between two computers.

Which cloud services describes a computing environment where an Internet server hosts and deploys applications?

SaaS (Software-as-a-Service) SaaS—also known as cloud-based software or cloud applications—is application software that's hosted in the cloud and that you access and use via a web browser, a dedicated desktop client, or an API that integrates with your desktop or mobile operating system.

When a user runs an application what transfers from a storage device to memory?

When a user runs an application, its instructions transfer from a storage device to memory. Each processor core on a multi-core processor generally runs faster than a single-core processor. A dual-core processor doubles the processing speed of a single-core processor.