Sabine Hossenfelder: Backreaction: January 2015

Wednesday, January 28, 2015

No, the “long sought-after link between the theories of quantum mechanics and general relativity” has not been found

The Physics arXiv blog has praised a paper called “On the weight of entanglement” and claimed that the author, David Edward Bruschi, found a new link between quantum mechanics and general relativity. Unfortunately, the paper is mostly wrong, and that what isn’t wrong isn’t new.

It is well known that quantum particles too must have gravitational fields and that measuring these gravitational fields would in principle tell us something about the quantization of gravity. Whenever you have a state in a superposition of two position states, its gravitational field too should be in a superposition. However, the gravitational field of all particles, elementary or composite, that display quantum properties is way too small to be measured. Even if you take the heaviest things that have yet been brought in superpositions of location you are still about 30 orders of magnitude off. I have done these estimates dozens of times.

The only way you can find larger effects is if you exploit secondary consequences of models that are not just perturbatively quantized gravity. For example the Schrödinger-Newton equation that assumes that the gravitational field remains classical even though particles are quantized can have odd side effects like preventing particle dispersion, or reducing the Heisenberg uncertainty. These effects can be somewhat larger, but they are still much too small to be measurable. The problem is always the same: gravity is weak, really weak. Nobody has ever measured the gravitational field of an atom. We measure gravitational fields of large things: balls, mountains, planets.

In the new paper, the author argues that entanglement “has weight.” By this he seems to mean that the full entangled state couples to gravity. It would be more surprising if that wasn’t so, but the treatment in the paper is problematic for several reasons.

The biggest problem is that the author in the paper uses semi-classical gravity. That means he couples the expectation value of the stress-energy to the space-time background, not the operator, which you would do were you using perturbatively quantized gravity. It is remarkable that he doesn’t discuss this at all. He doesn’t mention any problems with this approach (discussed here), neither does he mention tests of the Schrödinger-Newton equation (discussed here). This makes me think that he must be very new to the field.

Using the semi-classical limit in the case discussed in the paper is problematic because this semi-classical approach does not only break down when you have strong gravity. It also breaks down when you have a large quantum uncertainty in the distribution of the stress-energy. Here “large” means that the uncertainty is larger than the typical width of the distribution. This can be formally shown, but it is intuitively clear: In such cases the gravitational field also must have quantum properties. While these deviations from the semi-classical limit do exist at small energies, they are too weak to be measurable. That the semi-classical limit doesn’t work in these cases has been discussed by Ford and others 30 years ago, see for example these lecture notes from 1997, page 34, and the therein mentioned reference of Ford’s1982 paper.

By using semi-classical gravity and then looking at the non-relativistic case, the new paper basically reinvents the Schrödinger-Newton limit. To make this really clear: the Schrödinger-Newton limit in this case is widely believed to be wrong for good reasons. Using it is a non-standard assumption about perturbatively quantized gravity. The author doesn’t seem to be aware of this.

He then points out that the interference terms of the state makes a contribution to the distribution of stress-energy, which is correct. This has previous been done for superposition states. I am not aware that it has previously also been done for entangled states, but since it isn’t measureable for superpositions, it seems rather pointless to look at states that are even more difficult to create.

He then argues that measuring this term would tell you something about how quantum states couple to gravity. This is also correct. He goes on to find that this is more than 30 orders of magnitude too small to be measurable. I didn’t check the numbers but this sounds plausible. He then states that “one could hope to increase the above result” by certain techniques and that “this could in principle make the effect measurable”. This is either wrong or nonsense, depending on how you look at it. The effect is “in principle” measurable, yes. Quantum gravity is “in principle measurable”, we know this. The problem is that all presently known effect aren’t measurable in practice, including the effect mentioned in the paper, as I am sure the author will figure out at some point. I am very willing to be surprised of course.

As a side remark, for all I can tell the state that he uses isn’t actually an entangled state. It is formally written as an entangled state (in Eq (4)), but the states labeled |01>; and |10> are single particle states, see Eq(5). This doesn’t look like an entangled state but like a superposition of two plane waves with a phase-difference. Maybe this is a typo or I’m misreading this definition. Be that as it may, it doesn’t make much of a difference for the main problem of the paper, which is using the semi-classical limit. (Update: It’s probably a case of details gotten lost in notation, see note added below.)

The author, David Edward Bruschi, seems to be a fairly young postdoc. He probably doesn’t have much experience in the field so the lack of knowledge is forgivable. He lists in the acknowledgements Jacob Bekenstein, who also has formerly tried his hands on quantum gravity phenomenology and failed, though he got published with it. I am surprised to find Bei-Lok Hu in the acknowledgements because he’s a bright guy and should have known better. On the other hand, I have certainly found myself in acknowledgements of papers that I hadn’t even seen, and on some instances had to insist being removed from the acknowledgement list, so that might not mean much.

Don’t get me wrong, the paper isn’t actually bad. This would have been a very interesting paper 30 years ago. But we’re not living in the 1980s. Unfortunately the author doesn’t seem to be familiar with the literature. And the person who has written the post hyping this paper doesn’t seem to know what they were talking about either.

In summary: Nothing new to see here, please move on.

[Note added: It was suggested to me that the state |0> defined in the paper above Eq(5) was probably meant to be a product state already, so actually a |0>|0>. The creation operators in Eq(5) then act on the first or second zero respectively. Then the rest would make sense. I’m not very familiar with the quantum information literature, so I find this a very confusing notation. As I said above though, this isn’t the relevant point I was picking at.]

Monday, January 26, 2015

Book review: "Cracking the Particle Code of the Universe" by John Moffat

Cracking the Particle Code of the Universe: the Hunt for the Higgs Boson
By John W Moffat
Oxford University Press (2014)

John Moffat’s new book covers the history of the Standard Model of particle physics from its beginnings to the recent discovery of the Higgs boson – or, as Moffat cautiously calls it, the new particle most physicists believe is the Standard Model Higgs. But Cracking the Particle Code of the Universe isn’t just any book about the Standard Model: it’s about the model as seen through the eyes of an insider, one who has witnessed many fads and statistical fluctuations come and go. As an emeritus professor at the University of Toronto, Canada and a senior researcher at the nearby Perimeter Institute, Moffat has the credentials to do more than just explain the theory and the experiments that back it up: he also offers his own opinion on the interpretation of the data, the status of the theories and the community’s reaction to the discovery of the Higgs.

The first half of the book is mainly dedicated to introducing the reader to the ingredients of the Standard Model, the particles and their properties, the relevance of gauge symmetries, symmetry breaking, and the workings of particle accelerators. Moffat also explains some proposed extensions and alternatives to the Standard Model, such as technicolor, supersymmetry, preons, additional dimensions and composite Higgs models as well as models based on his own work. In each case he lays out the experimental situation and the technical aspects that speak for and against these models.

In the second half of the book, Moffat recalls how the discovery unfolded at the LHC and comments on the data that the collisions yielded. He reports from several conferences he attended, or papers and lectures that appeared online, and summarizes how the experimental analysis proceeded and how it was interpreted. In this, he includes his own judgment and relates discussions with theorists and experimentalists. We meet many prominent people in particle physics, including Guido Altarelli, Jim Hartle and Stephen Hawking, to mention just a few. Moffat repeatedly calls for a cautious approach to claims that the Standard Model Higgs has indeed been discovered, and points out that not all necessary characteristics have been found. He finds that the experimentalists are careful with their claims, but that the theoreticians jump to conclusions.

The book covers the situation up to March 2013, so of course it is already somewhat outdated; the ATLAS collaboration’s evidence for the spin-0 nature of the Higgs boson was only published in June 2013, for example. But this does not matter all that much because the book will give the dedicated reader the necessary background to follow and understand the relevance of new data.

Moffat’s writing sometimes gets quite technical, albeit without recourse to equations, and I doubt that readers will fully understand his elaborations without at least some knowledge of quantum field theory. He introduces the main concepts he needs for his explanations, but he does so very briefly; for example, his book features the briefest explanation of gauge invariance I have ever come across, and many important concepts, such as cross-sections or the relation between the masses of force-carriers and the range of the force, are only explained in footnotes. The glossary can be used for orientation, but even so, the book will seem very demanding for readers who encounter the technical terms for the first time. However, even if they are not able to follow each argument in detail, they should still understand the main issues and the conclusions that Moffat draws.

Towards the end of the book, Moffat discusses several shortcomings of the Standard Model, including the Higgs mass hierarchy problem, the gauge hierarchy problem, and the unexplained values of particle masses. He also briefly mentions the cosmological constant problem, as it is related to questions about the nature of the vacuum in quantum field theory, but on the whole he stands clear from discussing cosmology. He does, however, comment on the anthropic principle and the multiverse and does not hesitate to express his dismay about the idea.

While Moffat gives some space to discussing his own contributions to the field, he does not promote his point of view as the only reasonable one. Rather, he makes a point of emphasizing the necessity of investigating alternative models. The measured mass of the particle-that-may-be-the-Higgs is, he notes, larger than expected, and this makes it even more pressing to find models better equipped to address the problems with “naturalness” in the Standard Model.

I have met Moffat on various occasions and I have found him to be not only a great physicist and an insightful thinker, but also one who is typically more up-to-date than many of his younger colleagues. As the book also reflects, he closely follows the online presentations and discussions of particle physics and particle physicists, and is conscious of the social problems and cognitive biases that media hype can produce. In his book, Moffat especially criticizes bloggers for spreading premature conclusions.

Moffat’s recollections also document that science is a community enterprise and that we sometimes forget to pay proper attention to the human element in our data interpretation. We all like to be confirmed in our beliefs, but as my physics teacher liked to say “belief belongs into the church.” I find it astonishing that many theoretical physicists these days publicly express their conviction that a popular theory “must be” right even when still unconfirmed by data – and that this has become accepted behavior for scientists. A theoretician who works on alternative models today is seen too easily as an outsider (a non-believer), and it takes much courage, persistence, and stable funding sources to persevere outside mainstream, like Moffat has done for decade and still does. This is an unfortunate trend that many in the community do not seem to be aware of, or do not see why it is of concern, and it is good that Moffat in his book touches on this point.

In summary, Moffat’s new book is a well-done and well-written survey of the history, achievements, and shortcomings of the Standard Model of particle physics. It will equip the reader with all the necessary knowledge to put into context the coming headlines about new discoveries at the LHC and future colliders.

This review first appeared in Physics World on Dec 4th under the title "A strong model, with flaws".

Thursday, January 22, 2015

Is philosophy bad science?

In reaction to my essay “Does the scientific method need revision?” some philosophers at Daily Nous are discussing what I might have meant in a thread titled “Philosophy as Bad Science?”:

“[S]he raises concerns about physicists being led astray by philosophers (Richard Dawid is mentioned as an alleged culprit [...]) into thinking that observation and testability through experimentation can be dispensed with. According to her, it may be alright for mathematicians and philosophers to pontificate about the structure of the universe without experimentation, but that, she says, is not what scientists should be doing.”

The internet has a funny way of polarizing opinions. I am not happy about this, so some clarifications.

First, I didn’t say and didn’t mean that philosophy is “bad science,” I said it’s not science. I am using the word “science” here as it’s commonly used in English where (unlike in German) science refers to study subjects that describe observations, in the broad sense.

Neither am I saying that philosophy is something physicists shouldn’t be doing at all, certainly not. Philosophy, as well as the history and sociology of science, can be very helpful for the practicing physicist to put their work in perspective. Much of what is today subject of physics was anticipated by philosophers thousands of years ago, such as the question whether nature is fundamentally discrete or continuous.

Scientists though should in the first place do science, ie their work should at least aim at describing observations.

Physicists today by and large don’t pay much attention to philosophy. In most fields that doesn’t matter much, but the closer research is to fundamental questions, the more philosophy comes into play. Presently that is mostly in quantum foundations, cosmology and quantum gravity (including string theory), and beyond-the-standard-model physics that relies on arguments of naturalness, simplicity or elegance.

Physicists are not “led astray by philosophers” because they don’t actually care what philosophers say. What is happening instead is that some physicists — well, make that string-theorists — are now using Richard Dawid’s arguments to justify continuing what they’re doing. That’s okay, I also think philosophers are better philosophers if they agree with what I’ve been saying all along.

I have no particular problem with string theorists, most of which today don’t actually do string theory any more, they do AdS/CFT. Which is fine by me, because much of the appeal of the gauge-gravity duality is its potential for phenomenological applications. (Then the problem is that they’re all doing the same, but I will complain about this another time.) String theory takes most of the heat simply because there are many string theorists and everybody has heard of it.

Just to be clear, when I say “phenomenology” I mean mathematical models describing observations. Phenomenology is what connects theory with experiment. The problem with today’s research communities is that the gap between theory and experiment is constantly widening and funding agencies have let it happen. With the gap widening, the theory is increasingly more mathematics and/or philosophy and increasingly less science. How wide a gap is too wide? The point I am complaining about is that the gap has become to wide. We have a lack of balance between theory disconnected from observation and phenomenology. Without phenomenology to match a theory to observation, the theory isn’t science.

Studying mathematical structures can be very fruitful for physics, sure. I understand that it takes time to develop the mathematics of a theory until it can be connected to observations, and I don’t think it makes much sense setting physicists a deadline by which insights must have appeared. But problems arise if research areas in physics which are purely devoted to mathematics, or are all tangled up in philosophy, become so self-supportive that they stop even trying to make contact to observation.

I don’t know how often I have talked to young postdocs in quantum gravity and they do not show the slightest intention to describe observation. The more senior people have at least learned the lip confessions to be added whenever funding is at stake, but it is pretty obvious that they don’t actually want to bother with observations. The economists have a very useful expression that is “revealed preferences.” It means, essentially, don’t listen to what they say, look at what they do. Yes, they all say phenomenology is important, but nobody works on it. I am sure you can name off the top of your head some dozen or so people working on quantum gravity, the theory. How many can you name who work on quantum gravity phenomenology? How many of these have tenure? Right. Why hasn’t there been any progress in quantum gravity? Because you can’t develop a theory without contact to observation.

It is really a demarcation issue for me. I don’t mind if somebody wants to do mathematical physics or philosophy of science. I just don’t want them to pretend they’re doing physics. This is why I like the proposal put forward by George Ellis and Joe Silk in their Nature Comment:

“In the meantime, journal editors and publishers could assign speculative work to other research categories — such as mathematical rather than physical cosmology — according to its potential testability. And the domination of some physics departments and institutes by such activities could be rethought.”

Tuesday, January 20, 2015

Does the Scientific Method need Revision?

Theoretical physics has problems. That’s nothing new — if it wasn’t so, then we’d have nothing left to do. But especially in high energy physics and quantum gravity, progress has basically stalled since the development of the standard model in the mid 70s. Yes, we’ve discovered a new particle every now and then. Yes, we’ve collected loads of data. But the fundamental constituents of our theories, quantum field theory and Riemannian geometry, haven’t changed since that time.

Everybody has their own favorite explanation for why this is so and what can be done about it. One major factor is certainly that the low hanging fruits have been picked, and progress slows as we have to climb farther up the tree. Today, we have to invest billions of dollars into experiments that are testing new ranges of parameter space, build colliders, shoot telescopes into orbit, have superclusters flip their flops. The days in which history was made by watching your bathtub spill over are gone.

Another factor is arguably that the questions are getting technically harder while our brains haven’t changed all that much. Yes, now we have computers to help us, but these are, at least for now, chewing and digesting the food we feed them, not cooking their own.

Taken together, this means that return on investment must slow down as we learn more about nature. Not so surprising.

Still, it is a frustrating situation and this makes you wonder if not there are other reasons for lack of progress, reasons that we can do something about. Especially in a time when we really need a game changer, some breakthrough technology, clean energy, that warp drive, a transporter! Anything to get us off the road to Facebook, sorry, I meant self-destruction.

It is our lacking understanding of space, time, matter, and their quantum behavior that prevents us from better using what nature has given us. And it is this frustration that lead people inside and outside the community to argue we’re doing something wrong, that the social dynamics in the field is troubled, that we’ve lost our path, that we are not making progress because we keep working on unscientific theories.

Is that so?

It’s not like we haven’t tried to make headway on finding the quantum nature of space and time. The arxiv categories hep-th and gr-qc are full every day with supposedly new ideas. But so far, not a single one of the existing approaches towards quantum gravity has any evidence speaking for it.

To me the reason this has happened is obvious: We haven’t paid enough attention to experimentally testing quantum gravity. One cannot develop a scientific theory without experimental input. It’s never happened before and it will never happen. Without data, a theory isn’t science. Without experimental test, quantum gravity isn’t physics.

If you think that more attention is now being paid to quantum gravity phenomenology, you are mistaken. Yes, I’ve heard them too, the lip confessions by people who want to keep on dwelling on their fantasies. But the reality is there is no funding for quantum gravity phenomenology and there are no jobs either. On the rare occasions that I have seen quantum gravity phenomenology mentioned on a job posting, the position was filled with somebody working on the theory, I am tempted to say, working on mathematics rather than physics.

It is beyond me that funding agencies invest money into developing a theory of quantum gravity, but not into its experimental test. Yes, experimental tests of quantum gravity are farfetched. But if you think that you can’t test it, you shouldn’t put money into the theory either. And yes, that’s a community problem because funding agencies rely on experts’ opinion. And so the circle closes.

To make matters worse, philosopher Richard Dawid has recently argued that it is possible to assess the promise of a theory without experimental test whatsover, and that physicists should thus revise the scientific method by taking into account what he calls “non-empirical facts”. By this he seems to mean what we often loosely refer to as internal consistency: theoretical physics is math heavy and thus has a very stringent logic. This allows one to deduce a lot of, often surprising, consequences from very few assumptions. Clearly, these must be taken into account when assessing the usefulness or range-of-validity of a theory, and they are being taken into account. But the consequences are irrelevant to the use of the theory unless some aspects of them are observable, because what makes up the use of a scientific theory is its power to describe nature.

Dawid may be confused on this matter because physicists do, in practice, use empirical facts that we do not explicitly collect data on. For example, we discard theories that have an unstable vacuum, singularities, or complex-valued observables. Not because this is an internal inconsistency — it is not. You can deal with this mathematically just fine. We discard these because we have never observed any of that. We discard them because we don’t think they’ll describe what we see. This is not a non-empirical assessment.

A huge problem with the lack of empirical fact is that theories remain axiomatically underconstrained. In practice, physicists don’t always start with a set of axioms, but in principle this could be done. If you do not have any axioms you have no theory, so you need to select some. The whole point of physics is to select axioms to construct a theory that describes observation. This already tells you that the idea of a theory for everything will inevitably lead to what has now been called the “multiverse”. It is just a consequence of stripping away axioms until the theory becomes ambiguous.

Somewhere along the line many physicists have come to believe that it must be possible to formulate a theory without observational input, based on pure logic and some sense of aesthetics. They must believe their brains have a mystical connection to the universe and pure power of thought will tell them the laws of nature. But the only logical requirement to choose axioms for a theory is that the axioms not be in conflict with each other. You can thus never arrive at a theory that describes our universe without taking into account observations, period. The attempt to reduce axioms too much just leads to a whole “multiverse” of predictions, most of which don’t describe anything we will ever see.

(The only other option is to just use all of mathematics, as Tegmark argues. You might like or not like that; at least it’s logically coherent. But that’s a different story and shall be told another time.)

Now if you have a theory that contains more than one universe, you can still try to find out how likely it is that we find ourselves in a universe just like ours. The multiverse-defenders therefore also argue for a modification of the scientific method, one that takes into account probabilistic predictions. But we have nothing to gain from that. Calculating a probability in the multiverse is just another way of adding an axiom, in this case for the probability distribution. Nothing wrong with this, but you don’t have to change the scientific method to accommodate it.

In a Nature comment last month, George Ellis and Joe Silk argue that the trend of physicists to pursue untestable theories is worrisome. I agree with this, though I would have said the worrisome part is that physicists do not care enough about the testability — and apparently don’t need to care because they are getting published and paid regardless.

See, in practice the origin of the problem is senior researchers not teaching their students that physics is all about describing nature. Instead, the students are taught by example that you can publish and live from outright bizarre speculations as long as you wrap them into enough math. I cringe every time a string theorist starts talking about beauty and elegance. Whatever made them think that the human sense for beauty has any relevance for the fundamental laws of nature?

The scientific method is often quoted as a circle of formulating and testing of hypotheses, but I find this misleading. There isn’t any one scientific method. The only thing that matters is that you honestly assess the use of a theory to describe nature. If it’s useful, keep it. If not, try something else. This method doesn’t have to be changed, it has to be more consistently applied. You can’t assess the use of a scientific theory without comparing it to observation.

A theory might have other uses than describing nature. It might be pretty, artistic even. It might be thought-provoking. Yes, it might be beautiful and elegant. It might be too good to be true, it might be forever promising. If that’s what you are looking for that’s all fine by me. I am not arguing that these theories should not be pursued. Call them mathematics, art, or philosophy, but if they don’t describe nature don’t call them science.

This post first appeared Dec 17 on Starts With a Bang.

Thursday, January 15, 2015

I'm a little funny

What I do in the library when I have a bad hair day ;)

The shirt was a Christmas present from my mother. I happened to wear it that day and then thought it fits well enough. It's too large for me though, apparently they don't cater to physicists in XS.

My voice sounds like sinus infection because sinus infection, sorry about that.

Wednesday, January 14, 2015

How to write your first scientific paper

The year is young and the arxiv numbers are now a digit longer, so there is much space for you to submit your groundbreaking new work. If it wasn't for the writing, I know.

I recently had to compile a publication list with citation counts for a grant proposal, and I was shocked when inspire informed me I have 67 papers, most of which got indeed published at some point. I'm getting old, but I'm still not wise, so to cheer me up I've decided at least I'm now qualified to give you some advice on how to do it.

First advice is to take it seriously. Science isn't science unless you communicate your results to other people. You don't just write papers because you need some items on your publication list or your project report, but to tell your colleagues what you have been doing and what are the results. You will have to convince them to spend some time of their life trying to retrace your thoughts, and you should make this as pleasant for them as possible.

Second advice: When in doubt, ask Google. There are many great advice pages online, for example this site from Writing@CSU explains the most common paper structure and what each section should contain. The Nature Education covers the same, but also gives some advice if English is not your native language. Inside Higher ED has some general advice on how to organize your writing projects.

I'll not even try to compete with these advice pages, I just want to add some things I've learned, some of which are specific to theoretical physics.

If you are a student, it is highly unlikely that you will write your first paper alone. Most likely you will write it together with your supervisor and possibly some other people. This is how most of us learn writing papers. Especially the structure and the general writing style is often handed down rather than created from scratch. Still, when the time comes to do it all on your own, questions crop up that previously didn't even occur to you.

Before you start writing

Ask yourself who is your intended audience. Are you writing to a small and very specialized community, or do you want your paper to be accessible to as many people as possible? Trying to increase your intended audience is not always a good idea, because the more people you want to make the paper accessible to, the more you will have to explain, which is annoying for the specialists.

The audience for which your paper is interesting depends greatly on the content. I would suggest that you think about what previous knowledge you assume the reader brings, and what not. Once you've picked a level, stick with it. Do not try to mix a popular science description with a technical elaboration. If you want to do both, better do this separately.

Then, find a good order in which to present your work. This isn't necessarily always the order in which you did it. I have an unfortunate habit of guessing solutions and only later justify these guesses, but I try to avoid doing this in my papers.

The Title

The title should tell the reader what the paper is about. Avoid phrases like "Some thoughts on" or "Selected topics in," these just tell the reader that not even you know what the paper is about. Never use abbreviations in the title, unless you are referring to an acronym of, say, an experiment or a code. Yes, just spell it out. If you don't see why, google that abbreviation. You will almost certainly find that it may mean five different things. Websearch is word-based, so be specific. Exceptions exist of course. AdS/CFT for example is so specific, you can use it without worries.

Keep in mind that you want to make this as easy for your readers as possible, so don't be cryptic when it's unnecessary.

There is some culture in theoretical physics to come up with witty titles (see my stupid title list), but if you're still working on being taken seriously I recommend to stay clear of "witty" and instead go for "interesting".

The Abstract

The abstract is your major selling point and the most difficult part of the paper. This is always the last part of the paper that I write. The abstract should explain which question you have addressed, why that is interesting, and what you have found, without going much into detail. Do not introduce new terminology or parameters in the abstract. Do not use citations in the abstract and do not use abbreviations. Instead, do make sure the most important keywords appear. Otherwise nobody will read your paper.

Time to decide which scientific writing style you find least awkward. Is it referring to yourself as "we" or "one"? I don't mind reading papers in the first person singular, but this arguably isn't presently the standard. If you're not senior enough to be comfortable with sticking out, I suggest you go with "we". It's easier than "one" and almost everybody does it.

PACS, MSC, Keywords

Almost all journals ask for a PACS or MSC classification and for keywords, so you might as well look them up when you're writing the paper. Be careful with the keywords. Do not tag your paper as what you wish it was, but as what it really is, otherwise you will annoy your readership, not to mention your referees who will be chosen based on your tagging. I frequently get papers submitted as "phenomenology" that have no phenomenology in them whatsoever. In some cases it has been pretty obvious that the authors didn't even know what the word means.

The Introduction

The introduction is the place to put your work into context and to explain your motivation for doing the work. Do not abuse the introduction to write a review of the field and do not oversell what you are doing, keep this for the grant proposals. If there is a review, refer to the review. If not, list the works most relevant to understand your paper. Do not attempt to list all work on the subject, unless it's a really small research area. Keep in mind what I told you about your audience. They weren't looking for a review.

Yes, this is the place to cite all your friends and your own papers, but be smart about it and don't overdo it, it doesn't look good. Excessive self-cites are a hallmark of crackpottery and desperation. They can also be removed from your citation count with one click. The introduction often ends with a layout of the sections to come and notations or abbreviations used.

Try to avoid reusing introductions from yourself, and certainly from other people. It doesn't look good if your paper gets marked as having a text overlap with some other paper. If it's just too tempting, I suggest you read whatever introduction you like, then put it away, and rewrite the text roughly as you recall it. Do not try to copy the text and rearrange the sentences, it doesn't work.

Methods, Technics, Background

The place to explain what you're working with, and to remind the reader of the relevant equations. Make sure to introduce all parameters and variables. Never refer to an equation only by name if you can write it down. Make this easy for your readers and don't expect them to go elsewhere to convert mentioned equation into your notation.

If your paper is heavy on equations, you will probably find yourself repeating phrases like "then we find", "so we get", "now we obtain", etc. Don't worry, nobody expects you to be lyrical here. In fact, I find myself often not even noticing these phrases anymore.

Main Part

Will probably contain your central analysis, whether analytical or numerical. If possible, try to include some simplified cases and discuss limits of your calculation, because this can greatly enhance the accessibility. If you have very long calculations that are not particularly insightful and that you do not need in other places, consider exporting them into an appendix or supplementary material (expansions of special functions and so on).

Results

I find it helpful if the results are separate from the main part because then it's easier at first reading to skip the details. But sometimes this doesn't make sense because the results are basically a single number, or you have lead a proof and the main part is the result. So don't worry if you don't have a separate section for this. However, if the results of your study need much space to be represented, then this is the place to do it.

Be careful to compare your results to other results in the fields. The reader wants to know what is new about your work, or what is different, or what is better. Do you confirm earlier results? Do you improve them? Is your result in disagreement with other findings? If not, how is it different?

Discussion

In most papers the discussion is a fluff part where the author can offer their interpretation of the results and tell the reader all that still has to be done. I also often use it to explicitly summarize all assumptions that I have made along the way, because that helps putting the results into context. You can also dump there all the friendly colleagues who will write to you after submission to "draw your attention to" some great work of theirs that you unfortunately seemed to have missed. Just add their reference with a sentence in the discussion and everybody is happy.

Conclusion/Summary

Repeat the most relevant part of the results, emphasize especially what is new. Write the conclusion so that it is possible to understand without having read the rest of the paper. Do not mash up the conclusion with the discussion, because you will lose those readers who are too impatient to make it through your interpretations to get to the main point.

References

Give credit where credit is due. You might have first read about some topic in a fairly recent paper, but you should try to find the original source and cite that too. Reference lists are very political. If this is one of your first papers in the field, I recommend you ask somebody who knows "the usual suspects" if you have forgotten somebody important. If you forget to cite many relevant references you will look like you don't know the subject very well, regardless of how many textbooks or review articles you have read.

If you cite online resources, you should include the date at which you have last accessed the reference to your quotation.

Keep your reference lists in good order, it's time well spent. You will probably be able to reuse them many times.

Figures

Include figures when they are useful, not just because you have them. Figures should always contain axis labels, and if you are using dimensionful units, they should include the units. Explain in the figure caption what's shown in the image; explain it as if the reader has not read the text. It's okay if it's repetitive.

If anyhow possible avoid figures that can only be understood when printed in color. Use different line styles or widths in addition to different colors. Be very careful with 3d plots. They are often more confusing than illuminating. Try to break them down into a set of 2d plots if you can.

Equations

Try to use notation that is close to that of the existing literature, it will make it vastly easier for people to understand your paper. Make sure you don't accidentally change notation throughout your calculations. If your equations get very long, try to condense them by breaking up expressions, or by introducing dimensionless variables, which can declutter expressions considerably.

SPELLCHECK (with caution)

I find it stunning that I still see papers full of avoidable typographical errors when one can spell check text online for free. Yes, I know it's cumbersome with the LaTeX code between the paragraphs, but if you're not spell checking your paper you're basically telling your reader you didn't think they're worth the time. Be careful though and don't let the cosmic ray become a comic ray.

... and sooner than you know you'll have dozens of publications to look back at!

Thursday, January 08, 2015

Do we live in a computer simulation?

Some days I can almost get myself to believe that we live in a computer simulation, that all we see around us is a façade designed to mislead us. There would finally be a reason for all this, for the meaningless struggles, the injustice, for life, and death, and for Justin Bieber. There would even be a reason for dark matter and dark energy, though that reason might just be some alien’s bizarre sense of humor.
It seems perfectly possible to me to trick a conscious mind, at the level of that of humans, into believing a made-up reality. Ask the guy sitting on the sidewalk talking to the trash bin. Sure, we are presently far from creating artificial intelligence, but I do not see anything fundamental that stands in way of such creation. Let it be a thousand years or ten thousand years, eventually we’ll get there. And once you believe that it will one day be possible for us to build a supercomputer that hosts intelligent minds in a world whose laws of nature are our invention, you also have to ask yourself whether the laws of nature that we ourselves have found are somebody else’s invention.

If you just assume the simulation that we might live in has us perfectly fooled and we can never find out if there is any deeper level of reality, it becomes rather pointless to even think about it. In this case the belief in “somebody else” who has created our world and has the power to manipulate it at his or her will differs from belief in an omniscient god only by terminology. The relevant question though is whether it is possible to fool us entirely.

Nick Bostrum has a simulation argument that is neatly minimalistic, though he is guilty of using words that end on ism. He is saying basically that if there are many civilizations running simulations with many artificial intelligences, then you are more likely to be simulated than not. So either you live in a simulation, or our universe (multiverse, if you must) never goes on to produce many civilizations capable of running these simulations for one reason or the other. Pick your poison. I think I prefer the simulation.

Math-me has a general issue with these kinds of probability arguments (same as with the Doomsday argument) because they implicitly assume that the probability distribution of lives lived over time is uncorrelated, which is clearly not the case since our time-evolution is causal. But this is not what I want to get into today because there is something else about Bostrum’s argument that has been bugging Physics-me.

For his argument, Bostrum needs a way to estimate how much computing power is necessary to simulate something like the human mind perceiving something like the human environment. And in his estimate he assumes, crucially, that it is possible to significantly compress the information of our environment. Physics-me has been chewing on this point for some while. The relevant paragraphs are:

“If the environment is included in the simulation, this will require additional computing power – how much depends on the scope and granularity of the simulation. Simulating the entire universe down to the quantum level is obviously infeasible, unless radically new physics is discovered. But in order to get a realistic simulation of human experience, much less is needed – only whatever is required to ensure that the simulated humans, interacting in normal human ways with their simulated environment, don’t notice any irregularities.

The microscopic structure of the inside of the Earth can be safely omitted. Distant astronomical objects can have highly compressed representations: verisimilitude need extend to the narrow band of properties that we can observe from our planet or solar system spacecraft. On the surface of Earth, macroscopic objects in inhabited areas may need to be continuously simulated, but microscopic phenomena could likely be filled in ad hoc. What you see through an electron microscope needs to look unsuspicious, but you usually have no way of confirming its coherence with unobserved parts of the microscopic world.

Exceptions arise when we deliberately design systems to harness unobserved microscopic phenomena that operate in accordance with known principles to get results that we are able to independently verify. The paradigmatic case of this is a computer. The simulation may therefore need to include a continuous representation of computers down to the level of individual logic elements. This presents no problem, since our current computing power is negligible by posthuman standards.”

This assumption is immediately problematic because it isn’t as easy as saying that whenever a human wants to drill a hole into the Earth you quickly go and compute what he has to find there. You would have to track what all these simulated humans are doing to know whenever that becomes necessary. And then you’d have to make sure that this never leads to any inconsistencies. Or else, if it does, you’d have to remove the inconsistency, which will add even more computing power. To avoid the inconsistencies, you’ll have to carry on all results for all future measurements that humans could possibly make, the problem being you don’t know which measurements they will make because you haven’t yet done the simulation. Dizzy? Don’t leave, I’m not going to dwell on this.

The key observation that I want to pick on here is that there will be instances in which The Programmer really has to ramp up the resolution to avoid us from finding out we’re in a simulation. Let me refer to what we perceive as reality as level zero, and a possible reality of somebody running our simulation as level 1. There could be infinitely many levels in each direction, depending on how many simulators simulate simulations.

This idea that structures depend on the scale at which they are tested and that at low energies you’re not testing all that much detail is basically what effective field theories are all about. Indeed, as Bostrom asserts, for much of our daily life the single motion of each and every quark is unnecessary information, atoms or molecules are enough. This is all fine by Physics-me.

Then these humans they go and build the LHC and whenever the beams collide the simulation suddenly needs a considerably finer mesh, or else the humans will notice there is something funny with their laws of nature.

Now you might think of blasting the simulation by just demanding so much fine-structure information all at once that the computer running our simulation cannot deliver. In this case the LHC would serve to test the simulation hypothesis. But there is really no good reason why the LHC should just be the thing to reach whatever computation limit exists at level 1.

But there is a better way to test whether we live in a simulation: Build simulations ourselves, the more the better. The reason is that you can’t compress what is already maximally compressed. So if the level 1 computation wants to prevent us from finding out that we live in a simulation by creating simulations ourselves, they’ll have to ramp up computational efficiency for that part of our level 0 simulation that is going to inhabit our simulation at level -1.

Now we try to create simulations that will create a simulation will create a simulation and so on. Eventually, the level 1 simulation will not be able to deliver any more, regardless of how good their computer is, and the then lowest level will find some strange artifacts. Something that is clearly not compatible with the laws of nature they have found so far and believed to be correct. This breakdown gets read out by the computer one level above, and so on, until it reaches us and then whatever is the uppermost level (if there is one).

Unless you want to believe that I’m an exceptional anomaly in the multiverse, every reasonably intelligent species should have somebody who will come up with this sooner or later. Then they’ll set out to create simulations that will create a simulation. If one of their simulations doesn’t develop into the direction of creating more simulations, they’ll scrap it and try a different one because otherwise it’s not helpful to their end.

This leads to a situation much like Lee Smolin’s Cosmological Natural Selection in which black holes create new universes that create black holes create new universes and so on. The whole population of universes then is dominated by those universes that lead to the largest numbers of black holes - that have the most “offspring.” In Cosmological Natural Selection we are most likely to find ourselves in a universe that optimizes the number of black holes.

In the scenario I discussed above the reproduction doesn’t happen by black holes but by building computer simulations. In this case then anybody living in a simulation is most likely to be living in a simulation that will go on to create another simulation. Or, to look at this from a slightly different perspective, if you want our species to continue thriving and avoid that The Programmer pulls the plug, you better work on creating artificial intelligence because this is why we’re here. You asked what’s the purpose of life? There it is. You’re welcome.

This also means you could try to test the probability of the simulation hypothesis being correct by seeing whether our universe does indeed have the optimal conditions for the creation of computer simulations.

Brain hurting? Don’t worry, it’s probably not real.

Saturday, January 03, 2015

Your g is my e – Has time come for a physics notation standard?

Standards make sure the nuts fit the bolts.
[Image Source: nutsandbolts.mit.edu]

The German Institute for Standardization, the “Deutsches Institut für Normung” (DIN), has standardized German life since 1917. DIN 18065 sets the standard for the height of staircase railings, DIN 18065 the surface of school bags to be covered with reflective stripes, and DIN 8270-2 the length of the hands of a clock. The Germans have a standard for pretty much everything from toilets to sleeping bags to funeral service.

Many of the German standards are now identical to European Standards, EN, and/or International Standards, ISO. According to DIN ISO 8610 for example the International Standard Day begins on Monday and the week has seven days. DIN EN 1400-1 certifies that a pacifier has two holes so that baby can still breathe if it manages to suck the pacifier into its mouth (it happens). The international standard DIN EN ISO 20126 assures that every bristle of your toothbrush can withhold a pull of at least 15 Newton (“Büschelauszugskraftprüfung” bristle-pull-off-force-test as the Germans call it). A lot of standards are dedicated to hardware supply and electronic appliances; they make sure that the nuts fit the bolts, the plugs fit the outlets, and the fuses blow when they should.

DIN EN 45020 is the European Standard for standards.

Where standards are lacking, life becomes cumbersome. Imagine every time you bought envelopes or folders you’d have to check they will actually fit to the paper you have. The Swedes have a different standard for paper punching than the Germans, neither of which is identical to the US American one. Filing cross-country taxes is painful for many reasons, but the punch issue is the straw that makes my camel go nuts. And let me not even get started about certain nations who don’t even use the ISO paper sizes because international is just the rest of the world.

Standards are important for consumer safety and convenience, but they have another important role which is to benefit the economic infrastructure by making reuse and adaptation dramatically easier. The mechanical engineers have figured that out a century ago, why haven’t the physicists?

During the summer I read a textbook on in-medium electrodynamics, a topic I was honestly hoping I’d never again have anything to do with, but unfortunately it was relevant for my recent paper. I went and flipped over the first 6 chapters or so because they covered the basics that I thought I know, just to then find that the later chapters didn’t make any sense. They gradually started making sense after I figured out that q wasn’t the charge and η not the viscosity.

Anybody who often works with physics textbooks will have encountered this problem before. Even after adjusting for unit and sign conventions, each author has their own notation.

Needless to say this isn’t a problem of textbooks only. I quite frequently read papers that are not directly in my research area, and it is terribly annoying having to waste time trying to decode the nomenclature. In one instance I recall being very confused about an astrophysics paper until it occurred to me that M probably wasn’t the mass of the galaxy. Yeah, haha, how funny.

I’m one of these terrible referees who will insist that every variable, constant, and parameter is introduced in the text. If you write p, I expect you to explain that it’s the momentum. (Or is it a pressure?) If you write g, I expect you to explain it’s the metric determinant. (Or is it a coupling constant? And what again is your sign convention?) If you write S, I expect you to explain it’s the action. (Or is it the entropy?)

I’m doing this mostly because if you read papers dating back to the turn of the last century it is very apparent that what was common notation then isn’t common notation any more. If somebody in a hundred years downloads today’s papers, I still want them to be able to figure out what the papers are about. Another reason I insist on this is that not explaining the notation can add substantial interpretational fog. One of my pet peeves is to ask whether x denotes a position operator or a coordinate. You can build whole theories of mixing these up.

You may wnat to dsicard this as some German maknig am eelphnat out of a muose, but think twice. You almots certainly have seen tihs adn smiliar memes that supposedly show how amazingly well the human brain is at sense-making and error correction. If we can do this, certainly we are able to sort out the nomenclature used in scientific papers. Yes, we are able to do this like you are able to decipher my garbled up English. But would you want to raed a whoel essay liek this?

The extra effort it takes to figure out somebody else’s nomenclature, even if it isn’t all that big a hurdle, creates friction that makes interdisciplinary work, even collaboration within one discipline, harder and thus discourages it. Researchers within one area often settle on a common or at least similar nomenclature, but this happens typically within groups that are already very specialized, and the nomenclature hurdle further supports this overspecialization. Imagine how much easier it would be to learn about a new subject if each paper used a standard notation or at least had a list of used notation added at the end, or in a supplement.

There aren’t all that many letters in the alphabets we commonly use, and we’d run out of letters quickly would we try to keep them all different. But they don’t need to be all different – more practical would be palettes for certain disciplines. And of course one doesn’t really have to fix each and every twiddle or index if it is explained in the text. Just the most important variables, constants, and observables would already be a great improvement. Say, that T that you are using there, does or doesn’t that include complex conjugation? And the D, is that the number of spatial coordinates only, or does it include the time-coordinate? Oh, and N isn’t a normalization but an integer, how stupid of me.

In fact, I think that the benefit, especially for students who haven’t yet seen all that many papers, would be so large that we will almost certainly sooner or later see such a nomenclature standard. And all it really takes is for somebody to set up a wiki and collect entries, then for authors to add a note that they used a certain notation standard. This might be a good starting point.

Of course a physics notation standard will only work if sufficient people come to see the benefit. I don’t think we’re quite there yet, but I am pretty sure that the day will come when some nation expects a certain standard for lecture notes and textbooks, and that day isn’t too far into the future.

Pages