Ensemble (mathematical physics)
Statistical mechanics |
---|
Inphysics,specificallystatistical mechanics,anensemble(alsostatistical ensemble) is an idealization consisting of a large number of virtual copies (sometimes infinitely many) of asystem,considered all at once, each of which represents a possible state that the real system might be in. In other words, a statistical ensemble is a set of systems of particles used in statistical mechanics to describe a single system.[1]The concept of an ensemble was introduced byJ. Willard Gibbsin 1902.[2]
Athermodynamic ensembleis a specific variety of statistical ensemble that, among other properties, is in statistical equilibrium (defined below), and is used to derive the properties ofthermodynamic systemsfrom the laws of classical or quantum mechanics.[3][4]
Physical considerations
[edit]The ensemble formalises the notion that an experimenter repeating an experiment again and again under the same macroscopic conditions, but unable to control the microscopic details, may expect to observe a range of different outcomes.
The notional size of ensembles in thermodynamics, statistical mechanics andquantum statistical mechanicscan be very large, including every possiblemicroscopic statethe system could be in, consistent with its observedmacroscopicproperties. For many important physical cases, it is possible to calculate averages directly over the whole of the thermodynamic ensemble, to obtain explicit formulas for many of the thermodynamic quantities of interest, often in terms of the appropriatepartition function.
The concept of an equilibrium or stationary ensemble is crucial to many applications of statistical ensembles. Although a mechanical system certainly evolves over time, the ensemble does not necessarily have to evolve. In fact, the ensemble will not evolve if it contains all past and future phases of the system. Such a statistical ensemble, one that does not change over time, is calledstationaryand can be said to be instatistical equilibrium.[2]
Terminology
[edit]- The word "ensemble" is also used for a smaller set of possibilitiessampledfrom the full set of possible states. For example, a collection ofwalkersin aMarkov chain Monte Carloiteration is called an ensemble in some of the literature.
- The term "ensemble" is often used in physics and the physics-influenced literature. Inprobability theory,the termprobability spaceis more prevalent.
Main types
[edit]The study of thermodynamics is concerned with systems that appear to human perception to be "static" (despite the motion of their internal parts), and which can be described simply by a set of macroscopically observable variables. These systems can be described by statistical ensembles that depend on a few observable parameters, and which are in statistical equilibrium. Gibbs noted that different macroscopic constraints lead to different types of ensembles, with particular statistical characteristics.
"We may imagine a great number of systems of the same nature, but differing in the configurations and velocities which they have at a given instant, and differing in not merely infinitesimally, but it may be so as to embrace every conceivable combination of configuration and velocities..." J. W. Gibbs(1903)[5]
Three important thermodynamic ensembles were defined by Gibbs:[2]
- Microcanonical ensemble(orNVE ensemble) —a statistical ensemble where the total energy of the system and the number of particles in the system are each fixed to particular values; each of the members of the ensemble are required to have the same total energy and particle number. The system must remain totally isolated (unable to exchange energy or particles with its environment) in order to stay in statistical equilibrium.[2]
- Canonical ensemble(orNVT ensemble)—a statistical ensemble where the energy is not known exactly but the number of particles is fixed. In place of the energy, thetemperatureis specified. The canonical ensemble is appropriate for describing a closed system which is in, or has been in, weakthermal contactwith a heat bath. In order to be in statistical equilibrium, the system must remain totally closed (unable to exchange particles with its environment) and may come into weak thermal contact with other systems that are described by ensembles with the same temperature.[2]
- Grand canonical ensemble(orμVT ensemble)—a statistical ensemble where neither the energy nor particle number are fixed. In their place, the temperature andchemical potentialare specified. The grand canonical ensemble is appropriate for describing an open system: one which is in, or has been in, weak contact with a reservoir (thermal contact, chemical contact, radiative contact, electrical contact, etc.). The ensemble remains in statistical equilibrium if the system comes into weak contact with other systems that are described by ensembles with the same temperature and chemical potential.[2]
The calculations that can be made using each of these ensembles are explored further in their respective articles. Other thermodynamic ensembles can be also defined, corresponding to different physical requirements, for which analogous formulae can often similarly be derived. For example, in the reaction ensemble, particle number fluctuations are only allowed to occur according to thestoichiometryof thechemical reactionswhich are present in the system.[6]
Equivalence
[edit]In thermodynamic limit all ensembles should produce identical observables due toLegendre transforms,deviations to this rule occurs under conditions that state-variables are non-convex, such as small molecular measurements.[7]
Representations
[edit]The precise mathematical expression for a statistical ensemble has a distinct form depending on the type of mechanics under consideration (quantum or classical). In the classical case, the ensemble is aprobability distributionover the microstates. In quantum mechanics, this notion, due tovon Neumann,is a way of assigning a probability distribution over the results of eachcomplete set of commuting observables. In classical mechanics, the ensemble is instead written as a probability distribution inphase space;the microstates are the result of partitioning phase space into equal-sized units, although the size of these units can be chosen somewhat arbitrarily.
Requirements for representations
[edit]Putting aside for the moment the question of how statistical ensembles are generatedoperationally,we should be able to perform the following two operations on ensemblesA,Bof the same system:
- Test whetherA,Bare statistically equivalent.
- Ifpis a real number such that0 <p< 1,then produce a new ensemble by probabilistic sampling fromAwith probabilitypand fromBwith probability1 −p.
Under certain conditions, therefore,equivalence classesof statistical ensembles have the structure of a convex set.
Quantum mechanical
[edit]A statistical ensemble in quantum mechanics (also known as a mixed state) is most often represented by adensity matrix,denoted by.The density matrix provides a fully general tool that can incorporate both quantum uncertainties (present even if the state of the system were completely known) and classical uncertainties (due to a lack of knowledge) in a unified manner. Any physical observableXin quantum mechanics can be written as an operator,.The expectation value of this operator on the statistical ensembleis given by the followingtrace:
This can be used to evaluate averages (operator),variances(using operator),covariances(using operator), etc. The density matrix must always have a trace of 1:(this essentially is the condition that the probabilities must add up to one).
In general, the ensemble evolves over time according to thevon Neumann equation.
Equilibrium ensembles (those that do not evolve over time,) can be written solely as a function of conserved variables. For example, themicrocanonical ensembleandcanonical ensembleare strictly functions of the total energy, which is measured by the total energy operator(Hamiltonian). The grand canonical ensemble is additionally a function of the particle number, measured by the total particle number operator.Such equilibrium ensembles are adiagonal matrixin the orthogonal basis of states that simultaneously diagonalize each conserved variable. Inbra–ket notation,the density matrix is
where the|ψi⟩,indexed byi,are the elements of a complete and orthogonal basis. (Note that in other bases, the density matrix is not necessarily diagonal.)
Classical mechanical
[edit]In classical mechanics, an ensemble is represented by a probability density function defined over the system'sphase space.[2]While an individual system evolves according toHamilton's equations,the density function (the ensemble) evolves over time according toLiouville's equation.
In amechanical systemwith a defined number of parts, the phase space hasngeneralized coordinatescalledq1,...qn,andnassociatedcanonical momentacalledp1,...pn.The ensemble is then represented by ajoint probability density functionρ(p1,...pn,q1,...qn).
If the number of parts in the system is allowed to vary among the systems in the ensemble (as in a grand ensemble where the number of particles is a random quantity), then it is a probability distribution over an extended phase space that includes further variables such as particle numbersN1(first kind of particle),N2(second kind of particle), and so on up toNs(the last kind of particle;sis how many different kinds of particles there are). The ensemble is then represented by ajoint probability density functionρ(N1,...Ns,p1,...pn,q1,...qn).The number of coordinatesnvaries with the numbers of particles.
Any mechanical quantityXcan be written as a function of the system's phase. The expectation value of any such quantity is given by an integral over the entire phase space of this quantity weighted byρ:
The condition of probability normalization applies, requiring
Phase space is a continuous space containing an infinite number of distinct physical states within any small region. In order to connect the probabilitydensityin phase space to a probabilitydistributionover microstates, it is necessary to somehow partition the phase space into blocks that are distributed representing the different states of the system in a fair way. It turns out that the correct way to do this simply results in equal-sized blocks of canonical phase space, and so a microstate in classical mechanics is an extended region in the phase space of canonical coordinates that has a particular volume.[note 1]In particular, the probability density function in phase space,ρ,is related to the probability distribution over microstates,Pby a factor
where
- his an arbitrary but predetermined constant with the units ofenergy×time,setting the extent of the microstate and providing correct dimensions toρ.[note 2]
- Cis an overcounting correction factor (see below), generally dependent on the number of particles and similar concerns.
Sincehcan be chosen arbitrarily, the notional size of a microstate is also arbitrary. Still, the value ofhinfluences the offsets of quantities such as entropy and chemical potential, and so it is important to be consistent with the value ofhwhen comparing different systems.
Correcting overcounting in phase space
[edit]Typically, the phase space contains duplicates of the same physical state in multiple distinct locations. This is a consequence of the way that a physical state is encoded into mathematical coordinates; the simplest choice of coordinate system often allows a state to be encoded in multiple ways. An example of this is a gas of identical particles whose state is written in terms of the particles' individual positions and momenta: when two particles are exchanged, the resulting point in phase space is different, and yet it corresponds to an identical physical state of the system. It is important in statistical mechanics (a theory about physical states) to recognize that the phase space is just a mathematical construction, and to not naively overcount actual physical states when integrating over phase space. Overcounting can cause serious problems:
- Dependence of derived quantities (such as entropy and chemical potential) on the choice of coordinate system, since one coordinate system might show more or less overcounting than another.[note 3]
- Erroneous conclusions that are inconsistent with physical experience, as in themixing paradox.[2]
- Foundational issues in defining thechemical potentialand thegrand canonical ensemble.[2]
It is in general difficult to find a coordinate system that uniquely encodes each physical state. As a result, it is usually necessary to use a coordinate system with multiple copies of each state, and then to recognize and remove the overcounting.
A crude way to remove the overcounting would be to manually define a subregion of phase space that includes each physical state only once and then exclude all other parts of phase space. In a gas, for example, one could include only those phases where the particles'xcoordinates are sorted in ascending order. While this would solve the problem, the resulting integral over phase space would be tedious to perform due to its unusual boundary shape. (In this case, the factorCintroduced above would be set toC= 1,and the integral would be restricted to the selected subregion of phase space.)
A simpler way to correct the overcounting is to integrate over all of phase space but to reduce the weight of each phase in order to exactly compensate the overcounting. This is accomplished by the factorCintroduced above, which is a whole number that represents how many ways a physical state can be represented in phase space. Its value does not vary with the continuous canonical coordinates,[note 4]so overcounting can be corrected simply by integrating over the full range of canonical coordinates, then dividing the result by the overcounting factor. However,Cdoes vary strongly with discrete variables such as numbers of particles, and so it must be applied before summing over particle numbers.
As mentioned above, the classic example of this overcounting is for a fluid system containing various kinds of particles, where any two particles of the same kind are indistinguishable and exchangeable. When the state is written in terms of the particles' individual positions and momenta, then the overcounting related to the exchange of identical particles is corrected by using[2]
This is known as "correct Boltzmann counting".
Ensembles in statistics
[edit]The formulation of statistical ensembles used in physics has now been widely adopted in other fields, in part because it has been recognized that thecanonical ensembleorGibbs measureserves to maximize the entropy of a system, subject to a set of constraints: this is theprinciple of maximum entropy.This principle has now been widely applied to problems inlinguistics,robotics,and the like.
In addition, statistical ensembles in physics are often built on aprinciple of locality:that all interactions are only between neighboring atoms or nearby molecules. Thus, for example,lattice models,such as theIsing model,modelferromagnetic materialsby means of nearest-neighbor interactions between spins. The statistical formulation of the principle of locality is now seen to be a form of theMarkov propertyin the broad sense; nearest neighbors are nowMarkov blankets.Thus, the general notion of a statistical ensemble with nearest-neighbor interactions leads toMarkov random fields,which again find broad applicability; for example inHopfield networks.
Ensemble average
[edit]Instatistical mechanics,theensemble averageis defined as themeanof a quantity that is a function of themicrostateof a system, according to the distribution of the system on its micro-states in thisensemble.
Since the ensemble average is dependent on theensemblechosen, its mathematical expression varies from ensemble to ensemble. However, themeanobtained for a given physical quantity does not depend on the ensemble chosen at thethermodynamic limit. Thegrand canonical ensembleis an example of anopen system.[8]
Classical statistical mechanics
[edit]For a classical system inthermal equilibriumwith its environment, theensemble averagetakes the form of an integral over thephase spaceof the system:
where
- is the ensemble average of the system propertyA,
- is,known asthermodynamic beta,
- His theHamiltonianof the classical system in terms of the set of coordinatesand their conjugate generalized momenta,
- is thevolume elementof the classical phase space of interest.
The denominator in this expression is known as thepartition functionand is denoted by the letterZ.
Quantum statistical mechanics
[edit]Inquantum statistical mechanics,for a quantum system in thermal equilibrium with its environment, the weighted average takes the form of a sum overquantum energy states,rather than a continuous integral:[clarification needed]
Canonical ensemble average
[edit]The generalized version of thepartition functionprovides the complete framework for working with ensemble averages in thermodynamics,information theory,statistical mechanicsandquantum mechanics.
Themicrocanonical ensemblerepresents an isolated system in which energy (E), volume (V) and the number of particles (N) are all constant. Thecanonical ensemblerepresents a closed system which can exchange energy (E) with its surroundings (usually a heat bath), but the volume (V) and the number of particles (N) are all constant. Thegrand canonical ensemblerepresents an open system which can exchange energy (E) and particles (N) with its surroundings, but the volume (V) is kept constant.
Operational interpretation
[edit]In the discussion given so far, while rigorous, we have taken for granted that the notion of an ensemble is valid a priori, as is commonly done in physical context. What has not been shown is that the ensembleitself(not the consequent results) is a precisely defined object mathematically. For instance,
- It is not clear where thisvery large set of systemsexists (for example, is it agasof particles inside a container?)
- It is not clear how to physically generate an ensemble.
In this section, we attempt to partially answer this question.
Suppose we have apreparation procedurefor a system in a physics lab: For example, the procedure might involve a physical apparatus and some protocols for manipulating the apparatus. As a result of this preparation procedure, some system is produced and maintained in isolation for some small period of time. By repeating this laboratory preparation procedure we obtain a sequence of systemsX1,X2,...,Xk,which in our mathematical idealization, we assume is aninfinitesequence of systems. The systems are similar in that they were all produced in the same way. This infinite sequence is an ensemble.
In a laboratory setting, each one of these prepped systems might be used as input foronesubsequenttesting procedure.Again, the testing procedure involves a physical apparatus and some protocols; as a result of the testing procedure we obtain ayesornoanswer. Given a testing procedureEapplied to each prepared system, we obtain a sequence of values Meas (E,X1), Meas (E,X2),..., Meas (E,Xk). Each one of these values is a 0 (or no) or a 1 (yes).
Assume the following time average exists:
For quantum mechanical systems, an important assumption made in thequantum logicapproach to quantum mechanics is the identification ofyes–noquestions to the lattice of closed subspaces of a Hilbert space. With some additional technical assumptions one can then infer that states are given by density operatorsSso that:
We see this reflects the definition of quantum states in general: A quantum state is a mapping from the observables to their expectation values.
See also
[edit]- Density matrix
- Ensemble (fluid mechanics)
- Phase space
- Liouville's theorem (Hamiltonian)
- Maxwell–Boltzmann statistics
- Replication (statistics)
Notes
[edit]- ^This equal-volume partitioning is a consequence ofLiouville's theorem,i. e., the principle of conservation of extension in canonical phase space for Hamiltonian mechanics. This can also be demonstrated starting with the conception of an ensemble as a multitude of systems. See Gibbs'Elementary Principles,Chapter I.
- ^(Historical note) Gibbs' original ensemble effectively seth= 1 [energy unit]×[time unit],leading to unit-dependence in the values of some thermodynamic quantities like entropy and chemical potential. Since the advent of quantum mechanics,his often taken to be equal to thePlanck constantin order to obtain a semiclassical correspondence with quantum mechanics.
- ^In some cases the overcounting error is benign. An example is thechoice of coordinate system used for representing orientations of three-dimensional objects.A simple encoding is the3-sphere(e. g., unitquaternions) which is adouble cover—each physical orientation can be encoded in two ways. If this encoding is used without correcting the overcounting, then the entropy will be higher byklog 2per rotatable object and the chemical potential lower bykTlog 2.This does not actually lead to any observable error since it only causes unobservable offsets.
- ^Technically, there are some phases where the permutation of particles does not even yield a distinct specific phase: for example, two similar particles can share the exact same trajectory, internal state, etc.. However, in classical mechanics these phases only make up an infinitesimal fraction of the phase space (they havemeasurezero) and so they do not contribute to any volume integral in phase space.
References
[edit]- ^Rennie, Richard; Jonathan Law (2019).Oxford Dictionary of Physics.pp. 458 ff.ISBN978-0198821472.
- ^abcdefghijGibbs, Josiah Willard(1902).Elementary Principles in Statistical Mechanics.New York:Charles Scribner's Sons.
- ^Kittel, Charles;Herbert Kroemer (1980).Thermal Physics, Second Edition.San Francisco: W.H. Freeman and Company. pp. 31 ff.ISBN0-7167-1088-9.
- ^Landau, L.D.;Lifshitz, E.M. (1980).Statistical Physics.Pergamon Press. pp. 9 ff.ISBN0-08-023038-5.
- ^Gibbs, J.W. (1928).The Collected Works, Vol. 2.Green & Co, London, New York: Longmans.
- ^Heath Turner, C.; Brennan, John K.; Lísal, Martin; Smith, William R.; Karl Johnson, J.; Gubbins, Keith E. (2008). "Simulation of chemical reaction equilibria by the reaction ensemble Monte Carlo method: a review".Molecular Simulation.34(2). Informa UK Limited:119–146.doi:10.1080/08927020801986564.ISSN0892-7022.
- ^Süzen, M; Sega, M; Holm, C (18 May 2009).""Ensemble inequivalence in single-molecule experiments"".Physical Review E.79(5): 051118.arXiv:0810.3407.doi:10.1103/PhysRevE.79.051118.Retrieved2024-03-03.
- ^"Statistical mechanics of classical systems"(PDF).George Mason University Physics and Astronomy Department.Retrieved3 November2023.