USER:THERMOCHAP SANDBOX

This is the user sandbox of Thermochap. A user sandbox is a subpage of the user's user page. It serves as a testing spot and page development space for the user and is not an encyclopedia article. Create or edit your own sandbox here.

Other sandboxes: Main sandbox | Template sandbox

Finished writing a draft article? Are you ready to request review of it by an experienced editor for possible inclusion in Wikipedia? Submit your draft for review!

This is a sandbox page for Thermochap. Thermochap ( talk) 22:30, 4 February 2021 (UTC)

Wikipedia pages that might be worth adding

Creating new pages is a more elaborate process than it used to be, but still worthwhile even it takes longer to do. Pages that might help include a page on:

Shape transform.
Multiplicity (statistical inference).
Traveler point dynamics.
~~Zone axis~~.
Natural history of invention,
~~Proper acceleration~~,
~~Kikuchi line~~,
~~Proper velocity~~.

Modern context for Introductory Physics

File:TippingPoints.jpg

Complexity from tipping points.

File:SizeScaleSerpent.png

Size-scale serpent bites its tail.

Clues to emerging insights into the relationships examined in introductory physics come from the study of symmetry breaks associated with tipping points^[1], like when a pencil balanced on its tip chooses a direction to fall, a steam bubble in a pot of hot water on the stove chooses where to form, or a folks on a picnic in a meadow choose a clear patch in which to lay their blanket. Some particular dichotomies underpinning the content of courses on "mechanics, oscillations & heat" might be called, in order of their emergence in the natural history of invention, something like "relational vs. standalone", "here vs. now", "past vs. future", and "inside vs. outside".

Relational versus stand-alone reality:

File:2020 04 23 relativity whiteboard.jpg

apr2020 relativity whiteboard.

In an extension of John Archibald Wheeler's It From Bit^[2], the basic insight is that in Newtonian physics the idea of objects (e.g. as stand-alone projectiles) is very useful, but modern insights into nature on both large and small size & time scales suggests that it's an approximation that works on intermediate size & time scales only. Assertions about correlations between subsystems e.g. measured in log-probability terms using information units, including the relationship between our ideas and reality, end up having traction in practice on all scales.

This observation underpins quantum wierdness in everyday life. Examples of this include the "explore all paths" ability of photons to "take the path of least time" in refraction (e.g. to form rainbows), helium-nucleus tunneling at work in your home's ionization smoke detector, electron tunneling that complicates the electronics industry's move to more compact integrated circuits, and the amazing behaviors of qubits in quantum computing and quantum copy-protection.

For more on this, see for example wikpedia's page on relational quantum theories inspired by Louis Crane & Carlo Rovelli. Carlo Rovelli's Seven brief lessons on physics^[3] in its own way frames the importance of the tri-fold connection between emergent views of gravitational spacetime, quantum mechanical indeterminacy, and the statistical approach to thermal/information physics. One rosetta stone in his view, somewhat less immediate than the connections that I've been interested in, is Stephen Hawking's tale about the quantum-mechanical heat/evaporation of black holes.

Rovelli, in his more recent book The order of time^[4], similarly deconstructs notions of global time in an entertaining way. His examples, and much of his narrative, still remain focused on weakly-constrained patterns and insights (like quantization at the Planck length) that are well-removed from today's here and now, in spite of his attempts to make them sound relevant. His clarity is also handicapped by his narrative about entropy as a state function, rather than as a delocalized measure of subsystem correlation. pFraundorf 12:56, 3 September 2018 (UTC)

Here versus now:

The traveler-point break from 4 to (3+1) vector symmetry in the observer experience (now discussed in the background section of our traveler point dynamics page as well as in this voicethread) is what makes time separate from space, and the concept of "now" different from the concept of "here". In other words, this gives traveling observers in spacetime their very own scalar time direction e.g. with (compared to the map-based spatial axes) its own units as well as its own measuring instrument designs.

Examples in everyday life where this separation between here and now becomes blurred include GPS systems, which have to correct for the fact that your head is aging faster than your feet, far away astrophysical events which "occur for us" long after the light which brings word of their happening is emitted, and electrons around atoms whose position can only be determined at any specific time by knocking them out of the atom.

File:GraphicalAbstractJTB.png

Correlation-based complexity powered by reversible thermalization.

File:Correlation2.png

Life's six broken symmetries.

File:Photo-1490031684356-1ffd9f6068e3.jpg

Competition that heals?

Past versus future:

The laws of physical dynamics may distinguish (for observers locally) between here and now, but they generally do not distinguish between past and future. In other words, time running backward is possible even if the events that result appear (as in the video of a window breaking) to be very unlikely.

The role of subsystem correlations (which raises its head in both wavefunction collapse and in thermal/information physics) then gives that scalar time its direction, because correlation between isolated subsystems fades stochastically in the forward time direction only. This delocalized correlation statement of "the 2^nd law of thermodynamics" is e.g. how YOU can tell often if a video is being played backwards or forward. Subsystem correlations are likely to grow unexpectedly in the reverse time direction only, e.g. when components of a broken egg "reassemble" back into an unbroken one.

Causal connections (which are otherwise merely relational^[4]) are then free to inherit this asymmetry. In other words, normal causes explain changes in subsystem correlation which unfold in the forward time direction only.

The recent movie Tenet, by the director of Interstellar, might be interesting to deconstruct in this context. N'est-ce pas? pFraundorf ( talk) 13:50, 19 January 2021 (UTC)

Inside versus outside:

File:ProposedLikeButton.JPG

Healthier like button?

Our experience (and that of other lifeforms) unfolds in the forward time direction through a series of attempts to buffer those otherwise vanishing correlations (e.g. for metazoans like the subsystem correlations that look in/out from skin, family, and culture). This is one way to see the emergence of a scalar time direction in our experience, even though our dynamical laws are reversible.

Life as we know it, in that sense, is a layered hierarchy of correlation-buffering subsystems^[5]^[6]^[7] on our planet's surface (including ourselves), powered by thermalization of available work mostly from our sun^[8]^[9]. Life's adaptation to that particular planetary surface runs deep, even if it as always is having to adapt e.g. to changes in population, climate, etc. This is one reason that migration off the planet's surface may not be as easy as it sounds^[10].

The symmetry breaks associated with subsystem-correlations that look in and out from skin, family, and culture are of special interest to multi-celled organisms like us. A focus on community (as distinct from individual) health in that context^[11], e.g. using experience sampling tools like that used by flu-near-you, may therefore prove to be especially useful downstream. N'est-ce pas? pFraundorf 11:16, 2 September 2018 (UTC)

Multiplicity (statistical inference)

In statistical mechanics and information theory, multiplicity is often a count of the effective number of states or choices. It is also the exponential form of surprisal, as illustrated in the equation S = k ln W on Ludwig Boltzmann's 1906 tomb, which may be solved for multiplicity W to define information units e.g. via the expression: #choices = W = e^S/k = 2^#bits.

Note that in this context both multiplicity (W) and surprisal in natural units (i.e. S/k) are fundamentally dimensionless, although the constant k in the latter case can be used to give surprisal differing units for different application areas. For example, in thermal physics it is traditional to set k equal to the Boltzmann constant e.g. in [J/K], while in communications k is often set to 1/ln2 to give surprisal in bits.

The question of the link between information entropy and thermodynamic entropy is a debated topic. In this discussion of multiplicities we treat them, regardless of their differences, as branches of the more general topic of Bayesian statistical inference^[12]^[13] designed for taking the best guess based on limited information.

Discrete systems

Multiplicity, as a technical type of effective count or enumeration, is easiest to describe in systems which have a separable number (rather than a continuum) of allowed states.

multiplicities from surprisals

Multiplicities, in finite discrete systems with all choices equally likely, are simply the number of choices i.e. the number of possible states. To open the door to their more sophisticated uses in the next section, it helps to define them in terms of probability p and/or the log-probability measure surprisal^[14] s = k ln[1/p] where k is used to define a variety of "dimensionless units" like bits (k=1/ln[2]).

The discrete-system multiplicity obtained by simple counting is just w = 1/p = e^s/k. Hence the surprisal ⇔ probability ⇔ multiplicity inter-conversion is summarized by:

0\leq s_{\text{p}}\equiv k\ln \left[w_{\text{p}}\right]\equiv k\ln \left[{\frac {1}{p}}\right]\leq \infty

,

where in terms of dimensionless multiplicity w, the units for surprisal are bits if k = 1/ln[2] relation. Boltzmann's entropy S = k ln W, where W is the "number of accessible states" and k is Boltzmann's constant e.g. in J. W. Gibbs' microcanonical ensemble, also assumes that all states are equally probable.

multiplicities from average suprisals

If the states instead differ in their accessibility (each e.g. with fractional probability p_i for i=1 to N), average surprisals and geometric-average multiplicities are used for applications in thermodynamics^[15] and communication theory^[16], with the result e.g. that entropy-first approaches are now standard in senior undergraduate^[17]^[18]^[19]^[20] and some introductory^[21] thermal physics treatments. These are derived in terms of simple desiderata e.g. in books by E. T. Jaynes^[12] and Phil Gregory^[13].

The conversion between multiplicities and surprisals for a set of N possibly unequal but normalized probabilities then becomes:

S_{\text{avg}}=k\sum _{i=1}^{N}p_{i}\ln \left[{\frac {1}{p_{i}}}\right]\leftrightarrow W_{\text{geo}}\equiv e^{\frac {S_{\text{avg}}}{k}}=\prod _{i=1}^{N}\left({\frac {1}{p_{i}}}\right)^{p_{i}}

.

Here normalization means that Σ_ip_i = 1, the "multiplicity" W_geo term represents a geometric (i.e. multiplicative) average, and k is used to determine the dimensionless "information units" used to measure surprisal. Note that 0 ≤ S_avg ≤ k ln[N], and 1 ≤ W_geo ≤ N, with the maximum in both cases obtained when all states are equally probable.

A use of this in communications theory involves 1951-vintage Huffman coding, which underlies a wide range of lossless compression strategies, and at least inspires the 1977-vintage Lempel-Ziv compression used in ZIP files as well as the non-lossy compression strategies used for GIF and PNG images. The basic idea is that Huffman and related coding strategies replace characters in a message or image with a binary code that requires fewer bits for the less common characters, in effect resulting in a "new language" comprised of codewords that occur with near-equal probability and something like the minimum codeword length, which is equal (e.g. in bits) to none other than the surprisal average or Shannon entropy value given above! In fact, compression algorithms like this can even be used for entropy estimations in thermodynamic systems^[22].

If one considers two probability distributions, p and q instead of just p, the previous quantity S_avg may be written as S_p/p more generally. The interconversion for the average surprisal, uncertainty, or entropy associated with expected probability-set q, as measured by operating probability-set p, can then be written:

0\leq S_{\text{p/p}}\leq S_{\text{q/p}}\equiv k\ln \left[W_{\text{q/p}}\right]\equiv k\sum _{i=1}^{N}p_{i}\ln \left[{\frac {1}{q_{i}}}\right]\leq \infty

.

Although written here for discrete probability-sets, these expressions are naturally adapted to continuous as well as to quantum mechanical (i.e. root-probability wavefunction) probability-sets^[23]^[24].

Note that the upper limit on S_p/p (in bits) is ln₂[N]. Also the fact that S_q/p ≥ S_p/p, i.e. that measurements using the wrong model q are always likely to be "more surprised" by observational data than those using the operating-model p, underlies least-squares parameter-estimation and Bayesian model-selection as well as the positivity of the correlation and thermodynamic availability measures discussed below.

Just as average surprisal S/k here is defined as Σ_if_iln[1/f_i] where f_i are a set of normalized fractional probabilities, so following the prescription S = k ln W it's natural to define the effective multiplicity of choices as W ≡ e^S/k = e^{Σ_if_iln[1/f_i} = Π_i(1/f_i)^f_i. Even multiplicity logarithms are huge when one is applying statistical inference to macroscopic physical systems with Avogadro's number of components. However in statistical problems where the total number of choices W is small, the multiplicity approach may be more useful.

multiplicities from net surprisals

Szilárd's vacuum-pump memory at kTln[2] Joules per bit.

The two-distribution notation mentioned above opens the door a wider range of applications, by introducing an explicit reference probability into the discussion. The result in terms of surprisals is the "total correlation" measure known as Kullback-Leibler divergence, relative entropy, or "net surprisal". In statistical inference generally, and in thermodynamics in particular, maximizing entropy more explicitly involves minimizing KL-divergence with reference to a uniform prior^[13]

Specific application areas include: (i) thermodynamic availability, (ii) algorithmic model selection, and (iii) the evolution of complexity. The surprisal ⇔ probability ⇔ multiplicity interconversion for these correlation analyses may be written:

0\leq I_{\text{q/p}}\equiv k\ln \left[M_{\text{q/p}}\right]\equiv k\ln \left[{\frac {W_{\text{q/p}}}{W_{\text{p/p}}}}\right]=k\sum _{i=1}^{N}p_{i}\ln \left[{\frac {p_{i}}{q_{i}}}\right]\leq \infty

.

Log-probability measures are useful for tracking subsystem-correlations in digital as well in analog complex systems. In particular tools based on Kullback-Leibler divergence I_KL ≥ 0 and the matchup-multiplicities M = e^I_KL/k = Π_i(p_i/q_i)^p_i associated with reference probability-set q_i have proven useful: (i) to engineers for measuring available-work or " exergy" in thermodynamic systems^[25]^[26], (ii) to communication scientists and geneticists for studies of: regulatory-protein binding-site structure^[27], relatedness^[28], network structure, & replication fidelity^[16]^[29]^[30], and (iii) to behavioral ecologists wanting to select from a set of simplifying-models the one which is least surprised by experimental data^[31]^[32] from a complex-reality.

These multi-moment correlation-measures also have 2nd law teeth making them relevant to quantum computing^[33], and they enable one to distinguish pair from higher-order correlations making them relevant to the exploration of order-emergence in a wide range of biological systems^[34]

Analog systems

The first physical science applications of multiplicity, including those by Ludwig Boltzmann, were likely in classical (i.e. non-quantum) statistical mechanics. Then, and more generally when parameters like energy, as well as position, velocity, etc., are allowed to take on a continuum of values, states that are uniformly distributed may allow one to connect multiplicity to a "volume" in that parameter (or "phase") space.

State probabilities then become differential probabilities e.g. per unit length, and thus dimensioned. This is a problem for surprisals and average surprisals^[13] because logarithms require dimensionless arguments. It is not a problem for multiplicities or for net surprisals. The former doesn't use logarithms, and the latter uses probability ratios in its logarithms.

In classical thermodynamics the absolute size of multiplicity was also a mystery, even though it was generally not crucial to predictions. This is not usually a problem in other application areas, and the uncertainty principle in quantum mechanics has to a large extent solved that problem in statistical mechanics as well.

For example, imagine a monatomic ideal gas of N distinguishable non-interacting atoms in a box of side L for which per atom (ΔxΔp)³ ≥ h³ (or any other constant that may also depend on N), where the position-momentum product on the left hand side is using either the Heisenberg uncertainty principle or momentum quantization in an infinite-well potential to divide the continuum of possible states into a finite number dependent on Planck's constant h. Then using the Newtonian expression for momentum p in terms of total energy E and particle mass m we can write:

W\simeq \left({\tfrac {L{\sqrt {2mE}}}{\Delta x\Delta p}}\right)^{3N}\simeq V^{N}E^{3N/2}\left({\tfrac {\sqrt {2m}}{h}}\right)^{3N}

,

where V ≡ L³ is box volume. It is then a simple matter to calculate S = k ln W and show that the definition of reciprocal temperature or coldness δS/δE ≡ 1/T implies equipartition i.e. E = (3/2)NkT, and that the definition of free expansion coefficient^[18] δS/δV ≡ P/T implies the ideal gas law i.e. PV = NkT.

References

^ Malcolm Gladwell (2000) The Tipping Point: How Little Things Can Make a Big Difference (Little Brown, NY) ISBN 0-316-31696-2.
^ Wheeler, John A. (1990). "Information, physics, quantum: The search for links". In Zurek, Wojciech Hubert. Complexity, Entropy, and the Physics of Information (Addison-Wesley, NY) pdf. OCLC 21482771
^ Carlo Rovelli (2016) Seven brief lessons on physics (Riverhead Books).
^ ^a ^b Carlo Rovelli (2018) The order of time (Allen Lane).
^ Walter M. Elsasser (1966) Atom and organism (Princeton University Press, Princeton NJ).
^ Ludwig von Bertalanffy (1968) General system theory (G. Braziller, NY).
^ Sunny Y. Auyang (1998) Foundations of complex-system theories (Princeton U. Press, Princeton NJ) preview.
^ Harold J. Morowitz (1968) Energy Flow in Biology (Academic Press, NY).
^ Eric J. Chaisson (2004) "Complexity: An Energetics Agenda", Complexity 9, pp 14-21.
^ Peter D. Ward and Donald Brownlee (2000) Rare earth: Why complex life is uncommon in the universe (Copernicus, New York).
^ P. Fraundorf (2008) "A simplex model for layered niche-networks", Complexity 13:6, 29-39 abstract e-print.
^ ^a ^b E. T. Jaynes (2003) Probability theory: The logic of science (Cambridge University Press, UK) preview.
^ ^a ^b ^c ^d Phil C. Gregory (2005) Bayesian logical data analysis for the physical sciences: A comparative approach with Mathematica support (Cambridge U. Press, Cambridge UK) preview.
^ Myron Tribus (1961) Thermodynamics and Thermostatics: An Introduction to Energy, Information and States of Matter, with Engineering Applications (D. Van Nostrand, 24 West 40 Street, New York 18, New York, U.S.A) Tribus, Myron (1961), pp. 64-66 borrow.
^ J.W. Gibbs (1873) "A method of geometrical representation of thermodynamic properties of substances by means of surfaces", reprinted in The Collected Works of J. W. Gibbs, Volume I Thermodynamics, ed. W. R. Longley and R. G. Van Name (New York: Longmans, Green, 1931) footnote page 52.
^ ^a ^b Claude E. Shannon and Warren Weaver (1949, 1975-6th ed) The mathematical theory of communication (University of Illinois Press, Urbana IL).
^ Charles Kittel and Herbert Kroemer (1980 2nd ed) Thermal physics (W. H. Freeman, NY)
^ ^a ^b Claude Garrod (1995) Stat Mech and Thermodynamics (Oxford U. Press)
^ Keith Stowe (1984) Intro to Stat Mech and Thermodynamics (John Wiley, NY).
^ Daniel V. Schroeder (2000) Thermal physics (Addison-Wesley, NY)
^ Thomas A. Moore (2003 2nd ed) Six ideas that shaped physics (McGraw-Hill, NY).
^ Ram Avinery, Micha Kornreich, and Roy Beck (2019) "Universal and Accessible Entropy Estimation Using a Compression Algorithm", Phys. Rev. Lett. 123, 178102 abstract.
^ E. T. Jaynes (1957) "Information theory and statistical mechanics", Physical Review 106:620 ( link).
^ E. T. Jaynes (1957) "Information theory and statistical mechanics II", Physical Review 108:171 ( link).
^ M. Tribus and E. C. McIrvine (1971) "Energy and information", Scientific American 224:179-186.
^ Szargut J., Morris D. R. and Steward F. R. (1988) Exergy analysis of thermal, chemical, and metallurgical processes (Hemisphere, New York).
^ G. D. Stormo and D. S. Fields (1998) "Specificity, energy and information in DNA-protein interactions", Trends in Biochemical Sciences 23, 109-113.
^ Charles H. Bennett, Ming Li and Bin Ma (May 11, 2003), "Chain Letters and Evolutionary Histories", Scientific American.
^ T. M. Cover and Joy A. Thomas (2006) Elements of information theory (John Wiley and Sons, New York) preview
^ Albert-László Barabási (2002,2003) Linked: how everything is connected to everything else and what it means for business, science, and everyday life (Plume/Penguin, New York) reviews
^ Kenneth P. Burnham and David R. Anderson (2001) "Kullback-Leibler information as a basis for strong inference in ecological studies", Wildlife Research 28:111-119 ( link).
^ Burnham, K. P. and Anderson D. R. (2002) Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach, Second Edition (Springer Science, New York) ISBN 978-0-387-95364-9.
^ Seth Lloyd (1989) "Use of mutual information to decrease entropy: Implications for the second law of thermodynamics", Physical Review A 39, 5378-5386 ( link).
^ Elad Schneidman, Susanne Still, Michael J. Berry II, and William Bialek (2003) "Network information and connected correlations", Physical Review Letters 91:23, 238701 ( link).

[Gladwell2000-1] Malcolm Gladwell (2000) The Tipping Point: How Little Things Can Make a Big Difference (Little Brown, NY) ISBN 0-316-31696-2.

[Wheeler1990-2] Wheeler, John A. (1990). "Information, physics, quantum: The search for links". In Zurek, Wojciech Hubert. Complexity, Entropy, and the Physics of Information (Addison-Wesley, NY) pdf. OCLC 21482771

[Rovelli2016-3] Carlo Rovelli (2016) Seven brief lessons on physics (Riverhead Books).

[Rovelli2018-4] Carlo Rovelli (2018) The order of time (Allen Lane).

[Elsasser1966-5] Walter M. Elsasser (1966) Atom and organism (Princeton University Press, Princeton NJ).

[Bertalanffy1968-6] Ludwig von Bertalanffy (1968) General system theory (G. Braziller, NY).

[Auyang1998-7] Sunny Y. Auyang (1998) Foundations of complex-system theories (Princeton U. Press, Princeton NJ) preview.

[Morowitz1968-8] Harold J. Morowitz (1968) Energy Flow in Biology (Academic Press, NY).

[Chaisson2004-9] Eric J. Chaisson (2004) "Complexity: An Energetics Agenda", Complexity 9, pp 14-21.

[WardBrownlee2000-10] Peter D. Ward and Donald Brownlee (2000) Rare earth: Why complex life is uncommon in the universe (Copernicus, New York).

[pf2008b-11] P. Fraundorf (2008) "A simplex model for layered niche-networks", Complexity 13:6, 29-39 abstract e-print.

[Jaynes2003-12] E. T. Jaynes (2003) Probability theory: The logic of science (Cambridge University Press, UK) preview.

[gregory05-13] Phil C. Gregory (2005) Bayesian logical data analysis for the physical sciences: A comparative approach with Mathematica support (Cambridge U. Press, Cambridge UK) preview.

[Tribus1961-14] Myron Tribus (1961) Thermodynamics and Thermostatics: An Introduction to Energy, Information and States of Matter, with Engineering Applications (D. Van Nostrand, 24 West 40 Street, New York 18, New York, U.S.A) Tribus, Myron (1961), pp. 64-66 borrow.

[Gibbs1873-15] J.W. Gibbs (1873) "A method of geometrical representation of thermodynamic properties of substances by means of surfaces", reprinted in The Collected Works of J. W. Gibbs, Volume I Thermodynamics, ed. W. R. Longley and R. G. Van Name (New York: Longmans, Green, 1931) footnote page 52.

[ShannonWeaver-16] Claude E. Shannon and Warren Weaver (1949, 1975-6th ed) The mathematical theory of communication (University of Illinois Press, Urbana IL).

[Kittel1980-17] Charles Kittel and Herbert Kroemer (1980 2nd ed) Thermal physics (W. H. Freeman, NY)

[Garrod1995-18] Claude Garrod (1995) Stat Mech and Thermodynamics (Oxford U. Press)

[Stowe1984-19] Keith Stowe (1984) Intro to Stat Mech and Thermodynamics (John Wiley, NY).

[20] Daniel V. Schroeder (2000) Thermal physics (Addison-Wesley, NY)

[6ideas-21] Thomas A. Moore (2003 2nd ed) Six ideas that shaped physics (McGraw-Hill, NY).

[Avinery2019-22] Ram Avinery, Micha Kornreich, and Roy Beck (2019) "Universal and Accessible Entropy Estimation Using a Compression Algorithm", Phys. Rev. Lett. 123, 178102 abstract.

[Jaynes1957a-23] E. T. Jaynes (1957) "Information theory and statistical mechanics", Physical Review 106:620 ( link).

[Jaynes1957b-24] E. T. Jaynes (1957) "Information theory and statistical mechanics II", Physical Review 108:171 ( link).

[25] M. Tribus and E. C. McIrvine (1971) "Energy and information", Scientific American 224:179-186.

[Szargut1988-26] Szargut J., Morris D. R. and Steward F. R. (1988) Exergy analysis of thermal, chemical, and metallurgical processes (Hemisphere, New York).

[Stormo1998a-27] G. D. Stormo and D. S. Fields (1998) "Specificity, energy and information in DNA-protein interactions", Trends in Biochemical Sciences 23, 109-113.

[28] Charles H. Bennett, Ming Li and Bin Ma (May 11, 2003), "Chain Letters and Evolutionary Histories", Scientific American.

[29] T. M. Cover and Joy A. Thomas (2006) Elements of information theory (John Wiley and Sons, New York) preview

[linked-30] Albert-László Barabási (2002,2003) Linked: how everything is connected to everything else and what it means for business, science, and everyday life (Plume/Penguin, New York) reviews

[31] Kenneth P. Burnham and David R. Anderson (2001) "Kullback-Leibler information as a basis for strong inference in ecological studies", Wildlife Research 28:111-119 ( link).

[32] Burnham, K. P. and Anderson D. R. (2002) Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach, Second Edition (Springer Science, New York) ISBN 978-0-387-95364-9.

[33] Seth Lloyd (1989) "Use of mutual information to decrease entropy: Implications for the second law of thermodynamics", Physical Review A 39, 5378-5386 ( link).

[Schneidman03-34] Elad Schneidman, Susanne Still, Michael J. Berry II, and William Bialek (2003) "Network information and connected correlations", Physical Review Letters 91:23, 238701 ( link).

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

Wikipedia pages that might be worth adding