Chapter 1: Introduction
-
Published:2021
Anirudh Singh, "Introduction", Concepts and the Foundations of Physics, Anirudh Singh
Download citation file:
The aim of this book is to demonstrate how a critical analysis of the conceptual basis of our understanding of the physical world can lend new and far-reaching insights into issues confronting the physics/cosmology communities today. This introductory chapter starts by providing a motivation for carrying out such analyses. It then demonstrates their need in the elaboration of the nature of physical theories and goes on to discuss some of the analytical techniques used by Einstein in formulating his General Principle of Relativity. The chapter ends by providing an overview of the contents of the remainder of the book.
1.1 Some Unanswered Questions and Dilemmas
In life as well as the sciences, we frequently meet situations where there is a lack of complete clarity on the nature of the events taking place. The problem may be due to the interpretation of the information presented or be purely conceptual in nature. A prolonged debate is usually avoided when the observer agrees to accept the established view. Such a decision, however, denies the opportunity for further discussion or investigation of the subject. The following lists some personal examples of such situations relating to well-known physical observations at both the microscopic and the cosmological scales.
1.1.1 Leicester University, ca. 1990
I was walking down the stairwell of the Physics Department building—a routine I had repeated hundreds of times before—but, this time, I decided to stop at the landing and look at the curious picture that had been hanging on the wall there all these years.
It was a small, ordinary-looking black-and-white picture in a brown frame. I took a closer look and found that it showed an array of closely packed gray dots on a dark background.
By this time a colleague had joined me at the scene and explained that it was a high-resolution scanning electron microscope (SEM) image of a silicon surface. The gray dots, he said, were individual silicon atoms that made up the crystalline structure of the solid material. This was my first view of such imagery and I was awestruck with the experience of actually viewing pictures of individual atoms for the first time.
Many years later, during my short stint in the Physics Department of Monash University in Australia, I had the opportunity to use an SEM and experiment with an atomic force microscope (AFM) for my experimental work. About the same time, I happened to come across a PhD dissertation by a student of condensed matter physics, who had been charged with undertaking similar experiments for his work. However, although he used the results of high-resolution microscopy, the introduction to his thesis showed that he was not at all convinced that the “pictures” of atoms he saw were real. He argued strongly that what we saw through the instrument really depended on how we interpreted the information in the image and this depended on the personal judgement of the viewer.
Although it was clear that the student's supervisor would not have been at all happy with such a remark, can we really dismiss the disgruntled murmurings of the student researcher as irrelevant? Upon reflection, the student had merely pointed out that the nature of the physical reality being examined depended on the way we interpreted the results of a measurement carried out by an instrument. What should an appropriate response be to the student's observation?
1.1.2 New Delhi, India, December 2015
I was visiting New Delhi when the first detection of gravitational waves was announced. They were detected simultaneously by two detectors situated at different locations. The event made headline news globally.
According to accepted astrophysics, the waves were caused by the collision of two black holes millions of light years away. The distances involved meant that what we had experienced here on Earth only recently had happened far away and in the dim past of our astronomical history. And it reminded us that the only way we could learn of this happening was through the use of instruments used in experiments performed in the here and now on Earth. It makes one realize that it is not possible to directly experience things beyond our normal (human) reach, or that happen in the present at other locations. As in the case of investigating the microscopic world with high-powered microscopes, we must necessarily depend on instruments to carry out this task for us. These instruments act as extensions to our senses, and we have to contend with the results of such measurements to provide us with the only evidence of the reality of events at these humanly unreachable scales.
We are essentially trapped in the present and here where we exist in the space–time continuum. It is not a very reassuring situation, but we need to understand it as well as we can if we are to make any headway in understanding the new ideas emerging in the world of science today.
1.1.3 Auckland University, 1977
I sat in my nuclear physics lecture listening to my lecturer introducing the salient features of beta decay theory and the neutrino hypothesis. My lasting memory of that lecture is my steadfast reluctance to accept the words of the lecturer as he described the discovery of the neutrino in the search of an explanation for the continuous beta decay spectrum. My whole being was reacting to the assertion that an elusive new particle was being proposed as the answer to the riddle of the experimentally observed continuous beta spectrum. Was it not obvious that the answer lay in the falsification of the conservation laws of energy and momentum? After all, we were looking at experimental evidence and, as physicists, we were supposed to accept this as evidence of reality.
However, considerable time had gone by since the first proposal of the neutrino in 1930 and its detection in 1956 to the present, and the idea of the neutrino as a particle had been firmly established in the minds of physicists and lay persons alike. Today the detection of the neutrino (and its oscillations) is carried out on a routine basis (see, e.g., the IceCube experiment in Antarctica https://icecube.wisc.edu, accessed 14 April 2020), and it would be sacrilege to deny its existence! And yet the question still remains—how do we fit this historical evolution of a physical concept into a bigger picture?
What was notable about the discovery of the neutrino was the process by which it became acceptable. We will consider this process at greater length later.
1.2 The Subjective and the Objective
It is well known that there are differing views among thinkers about the nature of (physical) reality. We have, for instance, the positivists who claim that physical reality is absolute and exists regardless of the presence of humans, and the relativists who believe that it is purely subjective.
Although their views may differ widely, all thinkers will agree that we need concepts to think and that concepts exist in our minds. They are therefore subjective entities. A question that arises naturally is whether there is any relationship between the subjective and the objective.
The positivist will perhaps be quick to point out that physical reality is that which can be verified and measured through physical experiments, while the subjective cannot be verified in this way. And they will be essentially correct. We will later be asking whether the subjective can ever become objective. Is there a process by which a purely subjective entity can acquire objective physical reality? We will consider situations where this proposition may not appear as far-fetched as it first sounds.
1.3 The Nature of Physical Theories
Our theories of physics have undergone rapid evolution and transformation over the last few decades. This has occurred at both the microscopic and the cosmological scales. There has been a merger of theories at both scales to generate new ideas. This has provided new perspectives on physical reality, leading to a plethora of new theories, sometimes with surprisingly brilliant successes.
How do these new theories relate to older concepts and theories? To understand recent developments better, there is a need to examine the nature of physical theories upon which these newer ideas are based. As we will see later, these theories of physics may be categorized into the phenomenological (or ideal), the fundamental, and the historical. Over the decades, the nature of the theory in vogue has changed from the phenomenological/ideal (infinite and timeless universe) to a historical universe with a finite size and an identifiable beginning (the big bang theory). Theories have been concatenated, for instance the joining of quantum field theory (QFT) of the vacuum with an outcome of General Theory of Relativity (GTR) of a black hole to produce interesting results (e.g., Hawking radiation). The question that arises naturally is whether this type of gross concatenation is justified. Are there any requirements that such joining of diverse theories must fulfill?
1.4 Einstein and the Value of Conceptual Analysis
The examples in Sec. 1.3 are hopefully sufficient in themselves to reveal that there are several issues with how we conceive our physical world today, and they point to the need to re-examine the conceptual basis of our thinking to obtain a better understanding of developments in physics (and human thought in general). Before we launch into this task in the chapters that follow, it is possible that there may still be elements of lingering skepticism amongst some regarding the importance of concepts and conceptualization.
The best way to remove any such doubt is through an example. And an excellent example is provided in the work of Albert Einstein on his road to the establishment of the GTR. It is illustrative to see how Einstein used his powers of conceptualization to arrive at his first and second forms of the GTR, and followed this with similar feats of the imagination in the development of the final form of the GTR in 1916.
Einstein had worked out the Special Theory of Relativity (STR) in 1905, but it took him another ten years to come up with a satisfactory framework for a General Theory of Relativity. This was obviously not a simple task. To start earnestly on the task of discovering the new form of the laws of physics and nature, as epitomized by the law of gravitation, Einstein was confronted with a seemingly impossible task—how does one generalize the Special Principle of Relativity on which STR is based to a General Principle of Relativity upon which a GTR would be based. This proved to be a more difficult task than originally envisaged. So how did Einstein do it?
Box 1.1, which shows the thinking he used and the conceptual strategies he deployed to get there, demonstrates how meticulous conceptualization is absolutely vital in the formulation of new theories of any significance (Einstein, 1961).
—Paving the way to GTR
According to the Special Principle of Relativity, which provided the basis for Einstein's STR, if two reference frames (or reference bodies, say K and K′) are in uniform motion with respect to each other (i.e., were Galilean frames of reference), then the laws of physics (and indeed the behavior of all physical objects) would have the same form (or behave the same way) in both frames. Einstein successfully developed the correct equations of transformation that should relate such Galilean frames so that the equations of physics and the behavior of natural phenomena looked the same in both.
However, this meant that there are only certain frames of reference (or reference bodies, as he called them) relative to which the laws of physics are the same.
Einstein's burning question was: why should this property of nature be limited to such Galilean frames (or reference bodies K, K′) only? By accepting this distinction, he noted that we were giving these frames a privileged position among all possible frames of reference. He argued vehemently against such a proposition, and proposed the first form of his General Principle of Relativity as follows:
All bodies of references K, K′ are equivalent for the description of natural phenomena (formulation of the general laws of nature), whatever may be their state of motion.
After making such a statement, Einstein immediately saw the enormous task that lay before him, for it was obvious that this statement is not true even in the simplest cases of non-uniform relative motion of two frames.
He noted that whenever the motion of the reference body becomes non-uniform, the mechanical behavior of bodies around it change, that is, they appear to obey different laws of physics from before. In the simplest case, if the reference body is decelerated, the behavior of the bodies relative to it is different from before (for instance, they may seem to accelerate). They thus seem to be responding to laws of motion that are different from before.
Undaunted, Einstein pressed on. He saw that the new behavior may be explainable in terms of the appearance of gravitational fields, and he decided to look more closely at the nature of such fields. Einstein began by re-examining the well-known property of bodies moving under the effect of gravitational fields that they receive the same acceleration, regardless of their mass or other physical properties.
He noted that this could be used to show that the gravitational mass of a body is equal to its inertial mass. This equality had been noted by previous physicists such as Newton, but (in Einstein's words) “it had not been interpreted.” Einstein went the extra distance and interpreted this observation by asserting that the inertial mass of a body is the property that measures the inertia of the body, while the gravitational mass measures the weight, that is, the effectiveness of the action of a gravitational field on the body.
Armed with these thoughts, he continued his examination of the gravitational field and its relation to accelerating frames. In his famous thought experiment of the observer in a cage in an isolated corner of the universe, he showed how the acceleration field generated by an “external being” pulling the cage upward with a constant force could be interpreted by the observer inside the cage as being fully equivalent to a gravitational field—to the extent of showing that this field acting on the gravitational mass of a body suspended inside the cage gave exactly the same results as the inertial mass of the same body being accelerated by the force applied by the external observer.
He used this as the starting point for his attempt to rationalize the appearance of the gravitational field in the cage, and he generalized this example by asserting that any non-uniform frame of reference (or reference body K′) is equivalent to a uniform frame K in the presence of a gravitational field.
Einstein next observed that both the classical mechanics of Newton and his STR were wrong in restricting the principle of relativity to Galilean frames only.
In what may be termed one of his most pivotal decisions, Einstein now re-asserted his belief in the General Principle of Relativity. To illustrate his stance, he used the example of a gas cooker with two identical pots sitting on two burners. One observes that steam is rising out of only one of the pots, and is puzzled until one looks underneath and observes a “bluish something” underneath one of the pots. The observer is satisfied that this is the reason for the difference. Einstein next argued that if you look underneath and do not see anything odd (as in the situation with non-uniform frames) then you should be concerned and take up the intellectual challenge of explaining the cause of the rising steam.
Einstein suspected that the answer lay in the gravitational field and continued to examine its effects. He showed that a ray of light (e.g., inside the cage) will be bent in a gravitational field. He also made the simple but pertinent deduction from this observation that the speed of light would no longer be constant in a gravitational field.
He next noted that the General Principle of Relativity (GPR) (one prototype of which is reflected in the findings in the cage experiment) not only allows us to investigate special cases of the effects of gravitational fields, but also offers the opportunity to derive the general law of gravity itself. This would give him one equation of physics that satisfied the GPR and, by inference, suggests that others would satisfy the same condition.
However, before he could do this, Einstein pointed out that we had to investigate what effect, if any, a gravitational field has on the nature of the space–time continuum. He now began examining the effect (if any) of gravity on space–time. This was important, as he had previously (in his STR work) characterized space–time using (physical) clocks and rods laid out to form a Cartesian coordinate system.
Using the example of a rotating disc as a reference frame, he used his findings from his STR to show that clocks and measuring rods are affected by the gravitational field felt by an observer sitting in the peripheral region. From these observations, he inferred that clocks and measuring rods would not give reasonable measures of time and distance in any gravitational field.
During the measurement of the circumference, the measuring rod was placed tangentially and thus experienced length contraction, but retained its original length in the radial measurement of the diameter. This made the ratio of the circumference to the diameter larger than π, revealing that Euclidean geometry was no longer followed in a gravitational field.
In his STR work, Einstein had become acutely aware that physics existed in the real (physical) world, and had noted the importance of physically characterizing space–time using clocks and measuring rods. He had also noted that the most suitable coordinate system for the characterization of space–time is the Cartesian coordinate system. However, using the example of an unevenly heated marble top, he now demonstrated that Cartesian coordinates could not be used to characterize non-Euclidean space–time. One therefore had to find a new way (other than Cartesian coordinates) to characterize space–time in the presence of gravitational fields.
- He then realized that the (purely mathematically determined) Gaussian coordinate system could be used to describe space–time in the presence of a gravitational field. In the Cartesian coordinate system, the distance element ds between two neighboring points P and P′ in the space–time continuum is given by the simple relationwhere dx1 etc. are the differences in the coordinates of P and P′.(1.1)In the Gaussian coordinate system, the distance element is given by the more general expressionwhere the coefficients g11 etc. are generally different from unity for the case of non-Euclidean spaces, and the whole equation reduces to the simple form given by Eq. (1.1) for the Cartesian coordinates in the case of Euclidean space–time geometry.(1.2)
Einstein now reverted to the space–time continuum relevant to STR, where there are no gravitational fields, and noted that it was Euclidean, and could be characterized by Cartesian coordinate systems set up using clocks and measuring rods.
He also formally noted that space–time continua containing gravitational fields are generally non-Euclidean and cannot be characterized using Cartesian coordinate systems. However, they can be characterized using Gaussian coordinate systems, where the distance element does not in general take the simple “sum of squares” form.
He now realized that, unlike the STR case, space–time could be characterized in terms of a purely mathematical coordinate system. Indeed, this system could take the place of the (physical) reference bodies used in STR. Using these findings, he restated his General Principle of Relativity as
All Gaussian coordinate systems are essentially equivalent for the formulation of the general laws of nature.
Once the principle was established in this final form, the remaining task was to find a law of nature (in this case the law of gravitation), that satisfied this principle. It was well known that tensor equations are form-invariant under any transformation. Einstein now stated his Principle of General Covariance, according to which all laws of physics are to be expressed in covariant form using tensors (Walters, 2020). So, the first step was to find tensors that could represent physical quantities. The rest is well-documented history.
1.5 Einstein's Conceptual Techniques
As the box shows, Einstein used several conceptual techniques in arriving at his final result. These included (but were not limited to) the following:
Reinterpreting the equivalence of inertial and gravitational masses by assigning new significance to the two types of masses (a “degeneracy of concepts” approach).
Generalization from specific examples to general principles on several occasions:
The gravitational field that appears in the cage experiment is equivalent to an acceleration field caused by any force as measured from an external non-accelerated reference frame.
The effect of a gravitational field on the path of a light beam in the specific cage experiment implies that light is bent in any gravitational field.
Measurements of time and length on a rotating disc show that time measurements will be affected, and the geometry of the space would become non-Euclidean in any arbitrary gravitational field.
Measurements of lengths in the specific example of a differentially heated marble slab demonstrate that it is not possible to use Cartesian coordinates to characterize points in any non-Euclidean space–time continuum.
Perhaps the most important conceptual device that Einstein used was what may be termed a “Declaration of preferred hypothesis.” As the name implies, this was the statement of a physical principle that Einstein preferred over other similar principles (which he believed were restrictive).
Despite the obvious experimental evidence that the laws of physics (or mechanics at least) do not have the same form in frames with non-uniform motion, he argued that the contrary should be true, and stated his first form of the Principle of General Relativity (GPR). Later, he asserted his Principle of General Covariance, which formed the basis for the elucidation of the Einstein Field Equations.
He used the analogy of the gas cooker to try to justify the declaration of his GPR. The validity of his argument may be debated, but we will not go into a protracted philosophical argument here to either defend or oppose his proposition. The important point was that he stuck to his guns and, as a result, eventually developed the final form of the General Principle of Relativity. Once this feat was achieved, his (forced declaration of the) Principle of General Covariance for all laws of physics and especially of the law of gravitation (i.e., the Einstein Field Equations) was an easier task.
1.6 The Importance of the Concept of Symmetry
What drove Einstein to assert the GPR so firmly? Or what gave him the conviction and the courage to do so?
While he may not have stated this in words, it is clear that Einstein's compelling urge to generalize the invariance of physical laws from non-accelerated frames only to any accelerated frame arose form a personal conviction of the central role of symmetry in the elucidation of the behavior of the physical world. Others have been driven by similar considerations, both before and well after Einstein (notable examples being the discovery of the anti-particle, and the work of Gell-Mann and later workers in high energy physics). Indeed, this consideration is no stranger to us ordinary mortals, as we often cite symmetry as partial support for our arguments in the study of physics, as well as in the elaboration of other situations in life.
The really interesting question about the concept of symmetry is where this idea comes from in the first place. The property of symmetry is often associated with the vague concept of nature. What, indeed, is nature, and could this possibly be the source of symmetry in the universe? The last chapters of this book take a bold stance on addressing this question.
1.7 Structure of This Book
This book begins with a critical look at concepts and the role they play in our understanding of the physical (and social) worlds on the one hand, and what they reveal about our theoretical and imagined worlds of mathematics, fantasy, and other realms on the other.
Part II of the book focuses on the physical world, and takes a close look at the main concepts and theories upon which our present understanding of physics is based. It discusses how these concepts have evolved over time, and how certain “newer” concepts have taken precedence over older ones. It points out, however, that the older concepts are not to be regarded as a spent force, as they may yet produce useful insights [in the form, for instance, of the proposed Fundamental Energy-Mass Transformation (FEMT) model] into the present unanswered questions of physics.
The role of planning and developing experiments is just as critical to the experimental verification of theory as are the experimental results. The observer plays a vital role in verifying that results are real. The reader is reminded that the primary interest of the physicist is to understand physical reality. It is the experimental physicist who furnishes this role in the final instance.
It can be argued that the role of the human (observer) is central to both the development of physics and the formulation of theories and constructs in the other “worlds” mentioned in Part I. Part II goes on to show how history, the design of experiments in physics, and the implementation of the experiments all make their own indispensable contributions to the final form of what we perceive as the new physical reality.
Part III discusses issues with current theories of physics that have yet to be addressed. It stresses the need for an expanded conceptual set to construct a new conceptual framework for these issues. It ends by proposing some of the requirements that this new framework must satisfy.