Thebitis the most basicunit of informationincomputingand digitalcommunication.The name is aportmanteauofbinary digit.[1]The bit represents alogical statewith one of two possiblevalues.These values are most commonly represented as either"1"or"0",but other representations such astrue/false,yes/no,on/off,or+/are also widely used.

The relation between these values and the physical states of the underlyingstorageordeviceis a matter of convention, and different assignments may be used even within the same device orprogram.It may be physically implemented with a two-state device.

A contiguous group of binary digits is commonly called abit string,a bit vector, or a single-dimensional (or multi-dimensional)bit array. A group of eight bits is called onebyte,but historically the size of the byte is not strictly defined.[2]Frequently, half, full, double and quadruple words consist of a number of bytes which is a low power of two. A string of four bits is usually anibble.

Ininformation theory,one bit is theinformation entropyof a randombinaryvariable that is 0 or 1 with equal probability,[3]or the information that is gained when the value of such a variable becomes known.[4][5]As aunit of information,the bit is also known as ashannon,[6]named afterClaude E. Shannon.

The symbol for the binary digit is either "bit", per theIEC 80000-13:2008 standard, or the lowercase character "b", per theIEEE 1541-2002standard. Use of the latter may create confusion with the capital "B" which is the international standard symbol for the byte.

History

edit

The encoding of data by discrete bits was used in thepunched cardsinvented byBasile Bouchonand Jean-Baptiste Falcon (1732), developed byJoseph Marie Jacquard(1804), and later adopted bySemyon Korsakov,Charles Babbage,Herman Hollerith,and early computer manufacturers likeIBM.A variant of that idea was the perforatedpaper tape.In all those systems, the medium (card or tape) conceptually carried an array of hole positions; each position could be either punched through or not, thus carrying one bit of information. The encoding of text by bits was also used inMorse code(1844) and early digital communications machines such asteletypesandstock ticker machines(1870).

Ralph Hartleysuggested the use of a logarithmic measure of information in 1928.[7]Claude E. Shannonfirst used the word "bit" in his seminal 1948 paper "A Mathematical Theory of Communication".[8][9][10]He attributed its origin toJohn W. Tukey,who had written a Bell Labs memo on 9 January 1947 in which he contracted "binary information digit" to simply "bit".[8]

Physical representation

edit

A bit can be stored by a digital device or other physical system that exists in either of two possible distinctstates.These may be the two stable states of aflip-flop,two positions of anelectrical switch,two distinctvoltageorcurrentlevels allowed by acircuit,two distinct levels oflight intensity,two directions ofmagnetizationorpolarization,the orientation of reversible double strandedDNA,etc.

Bits can be implemented in several forms. In most modern computing devices, a bit is usually represented by anelectricalvoltageorcurrentpulse, or by the electrical state of a flip-flop circuit.

For devices usingpositive logic,a digit value of1(or a logical value of true) is represented by a more positive voltage relative to the representation of0.Different logic families require different voltages, and variations are allowed to account for component aging and noise immunity. For example, intransistor–transistor logic(TTL) and compatible circuits, digit values0and1at the output of a device are represented by no higher than 0.4 V and no lower than 2.6 V, respectively; while TTL inputs are specified to recognize 0.8 V or below as0and 2.2 V or above as1.

Transmission and processing

edit

Bits are transmitted one at a time inserial transmission,and by a multiple number of bits inparallel transmission.Abitwise operationoptionally processes bits one at a time. Data transfer rates are usually measured in decimal SI multiples of the unitbit per second(bit/s), such as kbit/s.

Storage

edit

In the earliest non-electronic information processing devices, such as Jacquard's loom or Babbage'sAnalytical Engine,a bit was often stored as the position of a mechanical lever or gear, or the presence or absence of a hole at a specific point of apaper cardortape.The first electrical devices for discrete logic (such aselevatorandtraffic lightcontrolcircuits,telephone switches,and Konrad Zuse's computer) represented bits as the states ofelectrical relayswhich could be either "open" or "closed". When relays were replaced byvacuum tubes,starting in the 1940s, computer builders experimented with a variety of storage methods, such as pressure pulses traveling down amercury delay line,charges stored on the inside surface of acathode-ray tube,or opaque spots printed onglass discsbyphotolithographictechniques.

In the 1950s and 1960s, these methods were largely supplanted bymagnetic storagedevices such asmagnetic-core memory,magnetic tapes,drums,anddisks,where a bit was represented by the polarity ofmagnetizationof a certain area of aferromagneticfilm, or by a change in polarity from one direction to the other. The same principle was later used in themagnetic bubble memorydeveloped in the 1980s, and is still found in variousmagnetic stripitems such asmetrotickets and somecredit cards.

In modernsemiconductor memory,such asdynamic random-access memory,the two values of a bit may be represented by two levels ofelectric chargestored in acapacitor.In certain types ofprogrammable logic arraysandread-only memory,a bit may be represented by the presence or absence of a conducting path at a certain point of a circuit. Inoptical discs,a bit is encoded as the presence or absence of amicroscopicpit on a reflective surface. In one-dimensionalbar codes,bits are encoded as the thickness of alternating black and white lines.

Unit and symbol

edit

The bit is not defined in theInternational System of Units(SI). However, theInternational Electrotechnical Commissionissued standardIEC 60027,which specifies that the symbol for binary digit should be 'bit', and this should be used in all multiples, such as 'kbit', for kilobit.[11]However, the lower-case letter 'b' is widely used as well and was recommended by theIEEE 1541 Standard (2002).In contrast, the upper case letter 'B' is the standard and customary symbol for byte.

Multiple bits

edit
Decimal
Value Metric
1000 kbit kilobit
10002 Mbit megabit
10003 Gbit gigabit
10004 Tbit terabit
10005 Pbit petabit
10006 Ebit exabit
10007 Zbit zettabit
10008 Ybit yottabit
10009 Rbit ronnabit
100010 Qbit quettabit
Binary
Value IEC Memory
1024 Kibit kibibit Kbit Kb kilobit
10242 Mibit mebibit Mbit Mb megabit
10243 Gibit gibibit Gbit Gb gigabit
10244 Tibit tebibit
10245 Pibit pebibit
10246 Eibit exbibit
10247 Zibit zebibit
10248 Yibit yobibit
Orders of magnitude of data

Multiple bits may be expressed and represented in several ways. For convenience of representing commonly reoccurring groups of bits in information technology, severalunits of informationhave traditionally been used. The most common is the unitbyte,coined byWerner Buchholzin June 1956, which historically was used to represent the group of bits used to encode a singlecharacterof text (untilUTF-8multibyte encoding took over) in a computer[2][12][13][14][15]and for this reason it was used as the basicaddressableelement in manycomputer architectures.The trend in hardware design converged on the most common implementation of using eight bits per byte, as it is widely used today.[as of?]However, because of the ambiguity of relying on the underlying hardware design, the unitoctetwas defined to explicitly denote a sequence of eight bits.

Computers usually manipulate bits in groups of a fixed size, conventionally named "words".Like the byte, the number of bits in a word also varies with the hardware design, and is typically between 8 and 80 bits, or even more in some specialized computers. In the early 21st century, retail personal or server computers have a word size of 32 or 64 bits.

TheInternational System of Unitsdefines a series of decimal prefixes for multiples of standardized units which are commonly also used with the bit and the byte. The prefixeskilo(103) throughyotta(1024) increment by multiples of one thousand, and the corresponding units are thekilobit(kbit) through theyottabit(Ybit).

Information capacity and information compression

edit

When the information capacity of a storage system or a communication channel is presented inbitsorbits per second,this often refers to binary digits, which is acomputer hardwarecapacity to store binary data (0or1,up or down, current or not, etc.).[16]Information capacity of a storage system is only an upper bound to the quantity of information stored therein. If the two possible values of one bit of storage are not equally likely, that bit of storage contains less than one bit of information. If the value is completely predictable, then the reading of that value provides no information at all (zero entropic bits, because no resolution of uncertainty occurs and therefore no information is available). If a computer file that usesnbits of storage contains onlym<nbits of information, then that information can in principle be encoded in aboutmbits, at least on the average. This principle is the basis ofdata compressiontechnology. Using an analogy, the hardware binary digits refer to the amount of storage space available (like the number of buckets available to store things), and the information content the filling, which comes in different levels of granularity (fine or coarse, that is, compressed or uncompressed information). When the granularity is finer—when information is more compressed—the same bucket can hold more.

For example, it is estimated that the combined technological capacity of the world to store information provides 1,300exabytesof hardware digits. However, when this storage space is filled and the corresponding content is optimally compressed, this only represents 295 exabytes of information.[17]When optimally compressed, the resulting carrying capacity approachesShannon informationorinformation entropy.[16]

Bit-based computing

edit

Certainbitwisecomputerprocessorinstructions (such asbit set) operate at the level of manipulating bits rather than manipulating data interpreted as an aggregate of bits.

In the 1980s, whenbitmappedcomputer displays became popular, some computers provided specializedbit block transferinstructions to set or copy the bits that corresponded to a given rectangular area on the screen.

In most computers and programming languages, when a bit within a group of bits, such as abyteorword,is referred to, it is usually specified by a number from 0 upwards corresponding to its position within the byte or word. However, 0 can refer to either themostorleast significant bitdepending on the context.

Other information units

edit

Similar totorqueandenergyin physics;information-theoretic informationand data storage size have the samedimensionalityofunits of measurement,but there is in general no meaning to adding, subtracting or otherwise combining the units mathematically, although one may act as a bound on the other.

Units of information used in information theory include theshannon(Sh), thenatural unit of information(nat) and thehartley(Hart). One shannon is the maximum amount of information needed to specify the state of one bit of storage. These are related by 1 Sh ≈ 0.693 nat ≈ 0.301 Hart.

Some authors also define abinitas an arbitrary information unit equivalent to some fixed but unspecified number of bits.[18]

See also

edit

References

edit
  1. ^Mackenzie, Charles E. (1980).Coded Character Sets, History and Development(PDF).The Systems Programming Series (1 ed.).Addison-Wesley Publishing Company, Inc.p. x.ISBN978-0-201-14460-4.LCCN77-90165.Archived(PDF)from the original on May 26, 2016.RetrievedAugust 25,2019.
  2. ^abBemer, Robert William(2000-08-08)."Why is a byte 8 bits? Or is it?".Computer History Vignettes.Archived fromthe originalon 2017-04-03.Retrieved2017-04-03.[…] WithIBM'sSTRETCHcomputer as background, handling 64-character words divisible into groups of 8 (I designed the character set for it, under the guidance of Dr.Werner Buchholz,the man who DID coin the term "byte"for an 8-bit grouping). […] TheIBM 360used 8-bit characters, although not ASCII directly. Thus Buchholz's "byte" caught on everywhere. I myself did not like the name for many reasons. […]
  3. ^Anderson, John B.; Johnnesson, Rolf (2006),Understanding Information Transmission
  4. ^Haykin, Simon (2006),Digital Communications
  5. ^IEEE Std 260.1-2004
  6. ^"Units: B".Archivedfrom the original on 2016-05-04.
  7. ^Abramson, Norman (1963).Information theory and coding.McGraw-Hill.
  8. ^abShannon, Claude Elwood(July 1948)."A Mathematical Theory of Communication"(PDF).Bell System Technical Journal.27(3): 379–423.doi:10.1002/j.1538-7305.1948.tb01338.x.hdl:11858/00-001M-0000-002C-4314-2.Archived fromthe original(PDF)on 1998-07-15.The choice of a logarithmic base corresponds to the choice of a unit for measuring information. If the base 2 is used the resulting units may be called binary digits, or more brieflybits,a word suggested byJ. W. Tukey.
  9. ^Shannon, Claude Elwood(October 1948). "A Mathematical Theory of Communication".Bell System Technical Journal.27(4): 623–666.doi:10.1002/j.1538-7305.1948.tb00917.x.hdl:11858/00-001M-0000-002C-4314-2.
  10. ^Shannon, Claude Elwood;Weaver, Warren(1949).A Mathematical Theory of Communication(PDF).University of Illinois Press.ISBN0-252-72548-4.Archived fromthe original(PDF)on 1998-07-15.
  11. ^National Institute of Standards and Technology (2008),Guide for the Use of the International System of Units.Online version.Archived3 June 2016 at theWayback Machine
  12. ^Buchholz, Werner(1956-06-11)."7. The Shift Matrix"(PDF).The Link System.IBM.pp. 5–6.StretchMemo No. 39G.Archived(PDF)from the original on 2017-04-04.Retrieved2016-04-04.[…] Most important, from the point of view of editing, will be the ability to handle any characters or digits, from 1 to 6 bits long […] the Shift Matrix to be used to convert a 60-bitword,coming from Memory in parallel, intocharacters,or "bytes"as we have called them, to be sent to theAdderserially. The 60 bits are dumped intomagnetic coreson six different levels. Thus, if a 1 comes out of position 9, it appears in all six cores underneath. […] The Adder may accept all or only some of the bits. […] Assume that it is desired to operate on 4 bitdecimal digits,starting at the right. The 0-diagonal is pulsed first, sending out the six bits 0 to 5, of which the Adder accepts only the first four (0-3). Bits 4 and 5 are ignored. Next, the 4 diagonal is pulsed. This sends out bits 4 to 9, of which the last two are again ignored, and so on. […] It is just as easy to use all six bits inAlpha numericwork, or to handle bytes of only one bit for logical analysis, or to offset the bytes by any number of bits. […]
  13. ^Buchholz, Werner(February 1977)."The Word" Byte "Comes of Age..."Byte Magazine.2(2): 144.[…] The first reference found in the files was contained in an internal memo written in June 1956 during the early days of developingStretch.Abytewas described as consisting of any number of parallel bits from one to six. Thus a byte was assumed to have a length appropriate for the occasion. Its first use was in the context of the input-output equipment of the 1950s, which handled six bits at a time. The possibility of going to 8 bit bytes was considered in August 1956 and incorporated in the design of Stretch shortly thereafter. The first published reference to the term occurred in 1959 in a paper "Processing Data in Bits and Pieces" byG A Blaauw,F P Brooks JrandW Buchholzin theIRE Transactions on Electronic Computers,June 1959, page 121. The notions of that paper were elaborated in Chapter 4 ofPlanning a Computer System (Project Stretch),edited by W Buchholz,McGraw-Hill Book Company(1962). The rationale for coining the term was explained there on page 40 as follows:
    Bytedenotes a group of bits used to encode a character, or the number of bits transmitted in parallel to and from input-output units. A term other thancharacteris used here because a given character may be represented in different applications by more than one code, and different codes may use different numbers of bits (ie, different byte sizes). In input-output transmission the grouping of bits may be completely arbitrary and have no relation to actual characters. (The term is coined frombite,but respelled to avoid accidental mutation tobit.)
    System/360took over many of the Stretch concepts, including the basic byte and word sizes, which are powers of 2. For economy, however, the byte size was fixed at the 8 bit maximum, and addressing at the bit level was replaced by byte addressing. […]
  14. ^Blaauw, Gerrit Anne;Brooks, Jr., Frederick Phillips;Buchholz, Werner(1962),"Chapter 4: Natural Data Units"(PDF),inBuchholz, Werner(ed.),Planning a Computer System – Project Stretch,McGraw-Hill Book Company, Inc./ The Maple Press Company, York, PA., pp. 39–40,LCCN61-10466,archived fromthe original(PDF)on 2017-04-03,retrieved2017-04-03
  15. ^Bemer, Robert William(1959)."A proposal for a generalized card code of 256 characters".Communications of the ACM.2(9): 19–23.doi:10.1145/368424.368435.S2CID36115735.
  16. ^abInformation in small bitsInformation in Small Bits is a book produced as part of a non-profit outreach project of the IEEE Information Theory Society. The book introduces Claude Shannon and basic concepts of Information Theory to children 8 and older using relatable cartoon stories and problem-solving activities.
  17. ^"The World's Technological Capacity to Store, Communicate, and Compute Information"Archived2013-07-27 at theWayback Machine,especiallySupporting online materialArchived2011-05-31 at theWayback Machine,Martin Hilbert and Priscila López (2011),Science,332(6025), 60-65; free access to the article through here: martinhilbert.net/WorldInfoCapacity.html
  18. ^Bhattacharya, Amitabha (2005).Digital Communication.Tata McGraw-Hill Education.ISBN978-0-07059117-2.Archivedfrom the original on 2017-03-27.
edit
  • Bit Calculator– a tool providing conversions between bit, byte, kilobit, kilobyte, megabit, megabyte, gigabit, gigabyte
  • BitXByteConverterArchived2016-04-06 at theWayback Machine– a tool for computing file sizes, storage capacity, and digital information in various units