Dancing David
Penultimate Amazing
Hopefull gudielines for this thread.
1. Please try to stay on topic, if you bring in material it should be directly related to the topic at hand. ( I know I am great derailer!)
2. Please do not spam the thread with multiple links that are unrelated to the topic, discussions of Dark Matter, Electric Universe, Plasma Cosmology or attacks on the same should be limited and relevamt to the thread.
3. Please do not use ad homs, character slurs or accuse people of reading comprehension problems, or attack spelling and grammar...
I would prefer to keep this thread open and unmoderated, I will request splits as needed. I also encourage people to notify mods of any posts that are rude or disruptive.
Thanks!
Now I want to say that I feel that Halton Arp is a great man, he has presented his ideas in a coherent fashion and at great odds, he is a respected astronomer who has gone against the grain and there is a chance he is right and will be vindicated. He made a great survey of galaxies that we now think to be tidally disrupted galaxies and they are some very cool objects.
However he asserts repeatedly that there is a correlation between his Arp catalog objects and QSOs , he is the one that made the claim and it is up to him and his supporters to explain and defend it.
Arp catalog, QSO association and Statistics
Halton Arp was looking at tidally distorted galaxies and felt that he noticed a correlation between these ‘Arp galaxies’ and QSO more commonly called quasars.
Here is a Google search for “Arp QSO” and many articles on this subject:
http://www.google.com/search?source=ig&hl=en&rlz=1G1GGLQ_ENUS265&q=Arp+QSO&btnG=Google+Search
or
http://www.google.com/search?hl=en&rlz=1G1GGLQ_ENUS265&q=QSO+Arp&btnG=Search
It is my contention that when the number of QSOs known was very small that these samples may have had vague meaning. However the number of QSOs now known is much much larger.
Here is a basic introduction to statistics and the concepts:
http://www.statsoft.com/textbook/stathome.html?sttable.html&1
and here is a quote:
I added the italics to the sentence that is relevant to my argument.
At the time Arp started his study there were very few on QSOs and the Arp catalog objects very few QSOs were known. ( I may try to address Poisson statistics later,
http://en.wikipedia.org/wiki/Poisson_distribution
because they used this statistical format simply because of the low sample of QSO objects at the time.)
However there are now a huge number of known QSOs. So a more simple and more accurate sample method may be used where there is the:
-QSO/Arp object group
-QSO/non Arp galaxy group
-QSO/random spot group.
So now there is the description of the strength of the correlation and the significance of the sample size.
At the time that Arp made these studies there was a very small sample size of QSOs and therefore there are a number of reasons there could be sampling error in the correlations that he said exist.
Therefore I contend that the significance of the correlation would only be meaningful when compared as a triple group study:
-QSOs/Arp objects
-QSOs/non Arp galaxies
-QSOs/random spot
to just say that there is significance from the single group of ‘QSO/Arp objects’ is not meaningful when the significance stands alone.
Now we get to why comparison between groups is important “Sampling Error”:
http://sestat.nsf.gov/docs/stderr10.html
Now wait you say, Arp did not ask the Arp catalog objects how they felt about the QSOs, this can’t be a survey!
But there is not an experimental group and a control group in this case as well. What Arp did was a survey none the less. He observed member of a population and claimed that there was a statistically relevant relation between QSOs and Arp catalog objects. However due to the knowledge constraints at the time, he used a very limited sample.
So I am not saying that Arp willfully made a sampling error but sampling error he did make, based upon the limited set of QSO available.
So to say that there is statistical significance for the Arp catalog and QSOs does not have meaning any more, it can be very subject sampling error. And that needs to be controlled for by taking a census:
-QSO/non Arp galaxies
-QSO/random spot
Then the correlation between QSO/Arp objects could have meaningful significance , if it rises above the level in the other two (census type) surveys at a rate higher than the standard deviation for the two census groups.
Possible objections:
1. This is not a peer reviewed paper, nor does it have to be. There is a serious methodological flaw in the assertion that the QSO/Arp objects correlation has meaning, it is the burden of the people making the assertion to address this flaw. Albeit it was not an intentional flaw but there is a he potential for sampling error. The burden of proof is on the one making the assertion.
2. That I should take the time to research this myself and publish the results, again not needed, I am not the one making the assertion, the burden of proof is on the one making the assertion.
3. I am not the one making the assertion that there is significance to the survey of QSOs/Arp objects, Arp, Burbidge and others are, there is a methodological flaw in the assertion that the correlation has meaning. It is upon them to address their methodology.
1. Please try to stay on topic, if you bring in material it should be directly related to the topic at hand. ( I know I am great derailer!)
2. Please do not spam the thread with multiple links that are unrelated to the topic, discussions of Dark Matter, Electric Universe, Plasma Cosmology or attacks on the same should be limited and relevamt to the thread.
3. Please do not use ad homs, character slurs or accuse people of reading comprehension problems, or attack spelling and grammar...
I would prefer to keep this thread open and unmoderated, I will request splits as needed. I also encourage people to notify mods of any posts that are rude or disruptive.
Thanks!

Now I want to say that I feel that Halton Arp is a great man, he has presented his ideas in a coherent fashion and at great odds, he is a respected astronomer who has gone against the grain and there is a chance he is right and will be vindicated. He made a great survey of galaxies that we now think to be tidally disrupted galaxies and they are some very cool objects.
However he asserts repeatedly that there is a correlation between his Arp catalog objects and QSOs , he is the one that made the claim and it is up to him and his supporters to explain and defend it.
Arp catalog, QSO association and Statistics
Halton Arp was looking at tidally distorted galaxies and felt that he noticed a correlation between these ‘Arp galaxies’ and QSO more commonly called quasars.
Here is a Google search for “Arp QSO” and many articles on this subject:
http://www.google.com/search?source=ig&hl=en&rlz=1G1GGLQ_ENUS265&q=Arp+QSO&btnG=Google+Search
or
http://www.google.com/search?hl=en&rlz=1G1GGLQ_ENUS265&q=QSO+Arp&btnG=Search
It is my contention that when the number of QSOs known was very small that these samples may have had vague meaning. However the number of QSOs now known is much much larger.
Here is a basic introduction to statistics and the concepts:
http://www.statsoft.com/textbook/stathome.html?sttable.html&1
and here is a quote:
Dependent vs. independent variables. Independent variables are those that are manipulated whereas dependent variables are only measured or registered. This distinction appears terminologically confusing to many because, as some students say, "all variables depend on something." However, once you get used to this distinction, it becomes indispensable. The terms dependent and independent variable apply mostly to experimental research where some variables are manipulated, and in this sense they are "independent" from the initial reaction patterns, features, intentions, etc. of the subjects. Some other variables are expected to be "dependent" on the manipulation or experimental conditions. That is to say, they depend on "what the subject will do" in response. Somewhat contrary to the nature of this distinction, these terms are also used in studies where we do not literally manipulate independent variables, but only assign subjects to "experimental groups" based on some pre-existing properties of the subjects. For example, if in an experiment, males are compared with females regarding their white cell count (WCC), Gender could be called the independent variable and WCC the dependent variable.
I added the italics to the sentence that is relevant to my argument.
At the time Arp started his study there were very few on QSOs and the Arp catalog objects very few QSOs were known. ( I may try to address Poisson statistics later,
http://en.wikipedia.org/wiki/Poisson_distribution
because they used this statistical format simply because of the low sample of QSO objects at the time.)
However there are now a huge number of known QSOs. So a more simple and more accurate sample method may be used where there is the:
-QSO/Arp object group
-QSO/non Arp galaxy group
-QSO/random spot group.
Why stronger relations between variables are more significant. Assuming that there is no relation between the respective variables in the population, the most likely outcome would be also finding no relation between those variables in the research sample. Thus, the stronger the relation found in the sample, the less likely it is that there is no corresponding relation in the population. As you see, the magnitude and significance of a relation appear to be closely related, and we could calculate the significance from the magnitude and vice-versa; however, this is true only if the sample size is kept constant, because the relation of a given strength could be either highly significant or not significant at all, depending on the sample size (see the next paragraph).
Why significance of a relation between variables depends on the size of the sample. If there are very few observations, then there are also respectively few possible combinations of the values of the variables, and thus the probability of obtaining by chance a combination of those values indicative of a strong relation is relatively high. Consider the following illustration. If we are interested in two variables (Gender: male/female and WCC: high/low) and there are only four subjects in our sample (two males and two females), then the probability that we will find, purely by chance, a 100% relation between the two variables can be as high as one-eighth. Specifically, there is a one-in-eight chance that both males will have a high WCC and both females a low WCC, or vice versa. Now consider the probability of obtaining such a perfect match by chance if our sample consisted of 100 subjects; the probability of obtaining such an outcome by chance would be practically zero. Let's look at a more general example. Imagine a theoretical population in which the average value of WCC in males and females is exactly the same. Needless to say, if we start replicating a simple experiment by drawing pairs of samples (of males and females) of a particular size from this population and calculating the difference between the average WCC in each pair of samples, most of the experiments will yield results close to 0. However, from time to time, a pair of samples will be drawn where the difference between males and females will be quite different from 0. How often will it happen? The smaller the sample size in each experiment, the more likely it is that we will obtain such erroneous results, which in this case would be results indicative of the existence of a relation between gender and WCC obtained from a population in which such a relation does not exist.
So now there is the description of the strength of the correlation and the significance of the sample size.
At the time that Arp made these studies there was a very small sample size of QSOs and therefore there are a number of reasons there could be sampling error in the correlations that he said exist.
Therefore I contend that the significance of the correlation would only be meaningful when compared as a triple group study:
-QSOs/Arp objects
-QSOs/non Arp galaxies
-QSOs/random spot
to just say that there is significance from the single group of ‘QSO/Arp objects’ is not meaningful when the significance stands alone.
What is "statistical significance" (p-value). The statistical significance of a result is the probability that the observed relationship (e.g., between variables) or a difference (e.g., between means) in a sample occurred by pure chance ("luck of the draw"), and that in the population from which the sample was drawn, no such relationship or differences exist. Using less technical terms, one could say that the statistical significance of a result tells us something about the degree to which the result is "true" (in the sense of being "representative of the population"). More technically, the value of the p-value represents a decreasing index of the reliability of a result (see Brownlee, 1960). The higher the p-value, the less we can believe that the observed relation between variables in the sample is a reliable indicator of the relation between the respective variables in the population. Specifically, the p-value represents the probability of error that is involved in accepting our observed result as valid, that is, as "representative of the population." For example, a p-value of .05 (i.e.,1/20) indicates that there is a 5% probability that the relation between the variables found in our sample is a "fluke." In other words, assuming that in the population there was no relation between those variables whatsoever, and we were repeating experiments like ours one after another, we could expect that approximately in every 20 replications of the experiment there would be one in which the relation between the variables in question would be equal or stronger than in ours. (Note that this is not the same as saying that, given that there IS a relationship between the variables, we can expect to replicate the results 5% of the time or 95% of the time; when there is a relationship between the variables in the population, the probability of replicating the study and finding that relationship is related to the statistical power of the design. See also, Power Analysis. In many areas of research, the p-value of .05 is customarily treated as a "border-line acceptable" error level.
Now we get to why comparison between groups is important “Sampling Error”:
http://sestat.nsf.gov/docs/stderr10.html
Sampling errors (the focus of this presentation) occur when estimates are derived from a sample rather than a census of the population. The sample used for a particular survey is only one of a large number of possible samples of the same size and design that could have been selected. Even if the same questionnaire and instructions were used, the estimates from each sample would differ from the others. This difference, termed sampling error, occurs by chance, and its variability is measured by the standard error associated with a particular survey.
Now wait you say, Arp did not ask the Arp catalog objects how they felt about the QSOs, this can’t be a survey!
But there is not an experimental group and a control group in this case as well. What Arp did was a survey none the less. He observed member of a population and claimed that there was a statistically relevant relation between QSOs and Arp catalog objects. However due to the knowledge constraints at the time, he used a very limited sample.
So I am not saying that Arp willfully made a sampling error but sampling error he did make, based upon the limited set of QSO available.
So to say that there is statistical significance for the Arp catalog and QSOs does not have meaning any more, it can be very subject sampling error. And that needs to be controlled for by taking a census:
-QSO/non Arp galaxies
-QSO/random spot
Then the correlation between QSO/Arp objects could have meaningful significance , if it rises above the level in the other two (census type) surveys at a rate higher than the standard deviation for the two census groups.
Possible objections:
1. This is not a peer reviewed paper, nor does it have to be. There is a serious methodological flaw in the assertion that the QSO/Arp objects correlation has meaning, it is the burden of the people making the assertion to address this flaw. Albeit it was not an intentional flaw but there is a he potential for sampling error. The burden of proof is on the one making the assertion.
2. That I should take the time to research this myself and publish the results, again not needed, I am not the one making the assertion, the burden of proof is on the one making the assertion.
3. I am not the one making the assertion that there is significance to the survey of QSOs/Arp objects, Arp, Burbidge and others are, there is a methodological flaw in the assertion that the correlation has meaning. It is upon them to address their methodology.
Last edited: