Showing posts with label Statistics. Show all posts
Showing posts with label Statistics. Show all posts

Thursday, 4 June 2015

The awkward question


Joe Vitruvius and his new history in Guangzhou: The awkward question.

Sometimes it's quite difficult to get a truthful answer from people in surveys. Fortunately, some mathematical methods have been created to overcome the reluctance of people to answer some awkward questions.


(This post participates on the 123th edition of the Carnival de Mathematics, hosted by the blog Mathematical mystery tour.)

FIRST HALF

Semifinal of the Champions League AFC between Guangzhou Evergrande and Kashiwa Reysol.
Next week there's an interesting semifinal of the AFC 2015 Champions League between the Chinese Guangzhou Evergrande and the Korean Kashiwa Reysol.

Fabio Cannavaro, coach of the Chinese club, has noticed that, in the last weeks, the level of play of his team has dropped a lot.

Some people say that some players of Guangzhou go out partying at night, and that's why they don't perform well in training sessions and matches. But nobody has proved it's true.

The club has hired some private investigators to follow them, but for now the players have always got rid of the detectives' persecution.

Badge of the Chinese team Guangzhou Evergrande. A curious Spanglish word.

Fabio is worried about it, and doesn't know what to do. He has called them one by one to his office, and has directly asked each one if rumours are true. But none of them has accepted that goes out at night. And they didn't want to say how many colleagues use to do it.

Guangzhou Evergrande players posing for the press. I don't know why the two players on the left are crouching.
It seems clear that they have lied, and that they form a very close group, and that they don't want to betray each other. But Fabio needs to know whether the problem of the trips at night is widespread or not among the team.


Tonight, while dinning at a restaurant in Hong Kong, he has met Joe Vitruvius, who has come to the city to take part in a Conference of Mathematics. Fabio has commented him on his problem, and Joe thinks that he should call his players again to his office.

Hong Kong restaurant where Fabio Cannavaro meets Joe Vitruvius. What a coincidence!

- But they will answer the same again, Joe.

- Not necessarily. You can get some of them to tell the truth.

- Well, I don't see how. I don't want to threaten them, nor to offer them any kind of reward to betray their teammates.

- Well, there's a way to get them to tell the truth, without threaten nor rewarding them...

Fabio Cannavaro, coach of Guangzhou Evergrande, who is closing his eyes like if he already had Chinese eyes.Joe Vitruvius, after having lost his luggage at the airport.

Can you imagine how they will know what's happening with the players?




I calculate, therefore I am


SECOND HALF


Guangzhou Evergrande's football stadium is wonderful.
- Let's see, Fabio, which were the questions you did?

- First, I asked: Do you go out for party at night? And everyone told me 'no'.

And then I asked:  Do you know how many playmates like partying all night? And they answered me that none.

- Well, I think you should call them back one at a time and ask them the same questions.

- I don't understand, Joe. If I ask them the same questions, I will get the same answers.

- No, because now we've got a magical coin.

50 yuan Chinese coin. I think that Joe Vitruvius's coin is counterfeit.

- I've already told you that you won't be able to bribe them. And even less with a simple coin, no matter how pretty or magical it is.

- I'm sure this coin will help us to know the truth, Fabio. 

- I can't see how.

- It's very simple. We'll call them one by one to your office, and we'll propose this: we'll give them this coin, and we'll tell them to toss it secretly. 

If they get a panda bear, they must answer truthfully about whether if they go out on a spree or not.

But if they get the snake, they always have to answer that they do go out at night, regardless of the truth.

It's clear: with the tender panda they will tell the whole truth; with the evil serpent they will say in all cases they like to party. Easy, right?

They'll flip the coin again. If they get a panda, they must tell us the number of players they know going out by night, while if they get a snake, they can invent the amount, and tell any figure between 0 and 20.

We will never know, anyway, if each individual player has got a panda or a snake, that is, we won't know if he's lying or not, so those who has got a panda will have no fear of telling us the truth. Do you think they will accept this deal?

- I think so. It seems that the method doesn't compromise them.

- Here we go. Tell the first player to come in...

Fabio Cannavaro office at Guangzhou Evergrande stadium. I mean, its door.

- ...we've finally completed the survey, Joe!

- Well, now we can get a clear idea about what's happening on your team.

- Oh, yes? You'll have to explain it to me. We don't know if each player has lied or has answered truthfully.

Results of the survey carried out among the players. I hope you haven't been confused with the snake and the panda.
- Let's see the results of our survey. Regarding the first question, in which we directly asked if they go out or not, we have a total of 12 positive answers and 8 negative ones

We have a 50-50 chance of getting a panda or a snake, so the most likely is that half of the players have told the truth and half the players have lied.

This means that, about 10 players have said that they go partying at night because they have got the snake. And among the other 10 players who got the panda, and therefore had to say the truth, 2 have answered that they go out, and 8 have answered that they don't.

They are 2 from 10, that's it, a 20% probability, so in the total of 20 players of your squad, we could think that there will be only 4 players go on a spree at night.

- Yes, but if it happens to get 20 pandas by chance, then all the answers would be true, and I'd have a problem with 12 players.

Probability of getting a certain number of faces and tails when tossing a coin 20 times, even in the case that the coin is Chinese. - It's true. But look at this other table. If we calculate the probability of getting a certain number of pandas, when we toss a coin 20 times, we can see that, first, the most likely option is that we get 10 pandas, a 17.62% of the times.

And, on second place, we notice that the chance of getting 20 pandas is about a 0.0001%, that is, once every million times we do the test. In fact, there is a probability of almost 98% that we get 14 pandas at maximum.

By the way, it seems a little risky to draw conclussions from a sample of only 10 players. So we should analyze the answers to the second question, in order to check if our assumptions are correct.

Granny, why we have ordered the survey data? Just to study them better... - But here, Joe, we find the same problem. We don't know which data are true and which data are invented. In fact, we have a set of answers, quite diverse. Some have answered that no player goes out, and other have said that there are 20 revellers.

- We can work on it, Fabio. If we calculate the average of the answers we have, we get that 5.25 people like partying. But the coefficient of variation, which measures the data dispersion, is enormous. We should work hard with sample data to obtain more acceptable statistical values.

We know that about half of the responses are invented, and therefore only half of the data are reliable. So we should try to eliminate some data, to get a more realistic average.

- But, how can we separate right data from invented data? If the second answers were related to the first responses, we would know that the data from those who said that they don't go out are correct, but they tossed the coin again before answering the second questions, so we can't get any help from this point.

- That's true. I've made them flip the coin twice just in order to avoid they lie when answering the second question for those who have answered truthfully the first one and have answered that they don't party late at night, because therefore we know that they got the panda and are obliged to tell the truth.

Fortunately, in Statistics there are some methods to eliminate invalid data that can distort the average.

There are data that are not real. How can we remove them?

Some statisticians remove those values furthest from the average Average, distanced from it by a certain multiple of the standard deviation Standard deviation, and keep only the data in the interval Data interval around the average

In other cases, we can sort the data from the lowest to the highest, and remove the first and the fourth quartile, leaving only the data of the two central quartiles, closer to the median.

With either of these two options, we can see how the standard deviation drops down considerably to a somewhat more acceptable value.

Although in our case, as we only have 20 data, we must be cautious when removing some of them. By the way, we have 3 values that we can reject because they're impossible.

- And, which are they?

- If there are 8 people who say that they don't go out, it's impossible that there are 17 or 20 lively players. And there's no way that 0 players go partying because, in that case, all the players who have got the panda should have answered that they don't know anyone who goes out, and there should be several zeros, not only one, except in the very unlikely event that they got 20 snakes.

- Yes, once in a million times that we did the survey, right?

More conclusions on the survey. I promise this is the last table of the story. - OK. We could go on the process by eliminating those less likely results, the furthest from the median or the average. Even in that case we could get confused, because not all the players know what they playmates do at night.

You know that not all of them are friends of each other, and that they don't have fun all together. That way, if we eliminate, for example, the datum of 2, maybe this number has been provided by a person who was telling the truth, but it's also incorrect, because he only knows 2 people who do it, but there are much more.

As the sample we have is very small, perhaps the bias that we generate when performing a screening of the remaining data would be more prejudicial than beneficial, so we will simply make the average of the data we have so far.

- Then, at the end, which of the methods of data cleaning will be use?

- If we eliminate some data, we can see that the method with a lowest coefficient of variation is that of the central quartiles. Anyway, the average of all the methods is close to the value of 4. And this figure coincides with the result of the first question, so we can conclude that around 20% of your players goes out at night.

Therefore, you can be happy, Fabio. It's very likely that only 4 players like to go for partying!

- That's right, and this means that the poor performance can be combated with more physical and tactical sessions. Now, the last work is to convince the 4 merry players to control themselves until we have won the championship...

Chinese team players celebrating the championship. Tonight nobody will tell them not to go partying.
And, are you sure that this method we have employed is reliable?

- Of course. This ingenious method is attributed to Eduardo Cattani, an Argentinian professor of Mathematics and Statistics at the University of Massachusetts, as Adrián Paenza refers in his book “Matemática... ¿estás ahí?". Morevoer, Stanley L. Warner, an American mathematician, published in March of 1965 an article about randomized responses and survey techniques for eliminating evasive answer bias in the Journal of the American Statistical Association.

Clearly, there are certain questions on sensible iussues like drugs consumption, sexual behaviour, illegal or forbidden topics, violence, bullying, socially frowned conducts, etc., in which respondents use to reply with incorrect answers.

Joe Vitruvius in front of the Shangxiajiu pedestrian street, wondering if he'd rather dinner a hamburger or a spring roll. The one way to ensure the anonymity and the privacy, and therefore to win the confidence of the respondents, is using these techniques of randomized answers, although they don't always work, sometimes because respondents just don't understand the mechanism, sometimes because they don't trust in all the procedure, and sometimes because, despite everything, they don't respond truthfully.

In our case, as we have done two questions on the same matter, we can be more sure of the result of our enquiry.

- Great, Joe. Many thanks for all. Have a good time in China!

- Sure, Now, I go for a walk through the center of Guangzhou (Canton) to see if I meet any of my Chinese friends or perhaps one of your players.

I hope you have good luck on your next matches, Fabio. Bye!






Below this lines  you will find other links, for if you liked this story and you want to share it with your friends.

And don't forget to take a walk by the 123th Carnival of Mathematics. There you'll find lots of excellent math posts that you'll surely like too.


Tuesday, 30 September 2014

An impossible transfer

A football agent burdened by failing to make any transfer during the summer, suddenly comes across a unique opportunity. The problem is that the acquiring team and the selling team have seemingly opposing interests. Will he be able to please both sides? The Will Rogers phenomenon and the game theory.
FIRST HALF

Will Rogers phenomenon
It's been an awful summer for Klaus Händler.

Klaus is a football agent, and during the last months none of the transfers that he has negotiated has come to a successful conclusion.

Some clubs he works with have no money for signing new players because of the economic crisis, and moreover, some of the players he represents are injured.

He's thinking he'll have to sell some of his dearest Lamborghini cars to afford the expenses of his comfortable life style during this winter, while, suddenly, the phone rings.

Selection of wonderful luxury car of the marca Lamborghini

FC Schalke 04 badge
It's Josef, chairman of the FC Schalke 04:

- Klaus, just now we've got 5 forwards in our team, and we want to transfer one of them. Could you take charge of it?

Just after hanging up the phone, it rings again. This time it's Michael, chairman of the Bayer 04 Leverkusen:

Bayer 04 Leverkussen badge
- Klaus, how are you? I call you because today is the transfer deadline day, and we want to sign one additional forward...

It looks like Klaus won't have to sell his luxury cars...

But there's a problem. Both teams are interested in increasing the average goal-scoring ability of their forward line and, at the same time, reducing the average wage of their players.

Increasing the average scoring ability and reducing the average wages at a time

Klaus phones some teams, to see if it's possible any agreement, but all the teams have completed their playing staffs.

It seems clear that it's not possible to transfer a forward from Schalke to Bayer, because of the imposed conditions by both teams. So, what can Klaus do? Could we help him in some way?




SECOND HALF

Joe Vitruvius
Klaus has decided to call his friend Joe Vitruvius, to see if he can help with this apparently unsolvable problem.

- Joe, I think that if we transfer one player from Schalke 04 to Bayer Leverkussen, the goal-scoring average of Bayer will rise at the same rate as Schalke's average decreases. And, in same way, the average wage of Bayer will go up at the same rate as the Schalke's average goes down.

Everything a team loses, the other wins it

- So, you think that everything a team gets, the other loses it. That's what we know as zero-sum game in mathematical game theory. But it's not always like this. As in real life, we can find situations in which all parts win or lose (non-zero-sum games).

- Yes, but I don't think this is one of the times it works.

- Well, we can try it. You told me that both teams want to increase the average goal-scoring ability of their forward line, and at the same time, reduce their average wage, didn't you?

Increase the average goal-scoring level and reduce the average wage at the same time.

- Yes, that's right.

- if we talk in absolute values, there's no way of doing it. But as we talk about averages, it may be possible. Can you give me the data of goals and salaries of the forwards of the two teams?

- Yes, of course. Here you are.

Salaries and goals of Schalke 04's forwards

Salaries and goals of Bayer 04 Leverkussen's forwards

- Well, let's calculate the means of goals and wages of both forward lines.

Data and averages of wages and goals of the Schalke 04's forwards
Data and averages of wages and goals of the Bayer 04 Leverkussen's forwards
- Now we can verify if there's a solution for your problem. We only have to apply the Will Rogers phenomenon.
Will Rogers, an American humorist
- And... what's that about?

- This paradox is related to a comment that Will Rogers, an American artist, said in the early 20th century: “When the Okies left Oklahoma and moved to California, they raised the average intelligence level in both states".

- He meant that when the citizens from Oklahoma move to California are the less intelligent persons of their state, even so they're more intelligent than California inhabitants, right?

Moving from Oklahoma to California

- That's right. This way, any inhabitant from Oklahoma whose intelligence is under the state average, that moves to California, makes that the average intelligence levels of both states experience an increment.

- And how can we apply it to our case?

- Well. Let's look at the Schalke 04 player, the Peruvian Jefferson Farfán. His goal-scoring ability is smaller than the team's average, but his salary is higher than his teammates average. However, he scores more goals than Bayer Leverkussen's forwards, and earns less money than them.

Comparison of the salary, and goal-scoring capacity of Jefferson Farfán vs. Schalke 04 levels
Comparison of the salary, and goal-scoring capacity of Jefferson Farfán vs. Schalke 04 Bayer Leverkussen levels

- I understand. If we transfer the player from one team to the other, assuming that he will earn the same, and that he will score the same goals, both teams will improve their statistics.

- That's right. Let's see how it works on the table.

Average wages and goal-scorings of Schalke 04's forwards after the transfer of Jefferson Farfán
Average wages and goal-scorings of Bayer 1904 Leverkussen's forwards after the transfer of Jefferson Farfán

- Indeed, both teams have improved their average goal-scoring level, and have reduced their average wages. And does this happen always?
Succulent and tipical German food
- No, not ever. But sometimes it does, like in this case.

- Well, you've just solved my economics this winter. I would have to invite you to a good meal

- I think this will cost you more than a simple meal. I prefer we do the following deal: you'll pay me in function of the goals that Jefferson scores this season. If he scores only one goal, you'll pay me 2 bitcoins. If he scores 2, you'll double the amount, 4 bitcoins. If he scores 3 goals, I will receive the double, that is, 8 bitcoins. And so on.

- That seems a fair deal. I agree.

Jefferson Farfán scoring a goal for Bayer Leverkussen.

The transfer was done. Schalke sold one of its excedent players, Bayer signed an extraordinary forward, and Klaus didn't have to get rid of his precious Lamborghini automobiles. At least during this winter, because Farfán scored 25 goles for his new team…

Can you calculate how many bitcoins Joe Vitruvius received from Klaus Händler at the end of the football season?

Some bitcoins, one of the internet virtual coins

If you're interested in learning more about this topic of the Will Rogers phenomenon and about the game theory, you can visit any of this magnific articles: The "Will Rogers Phenomenon" lets you save lives by doing nothingThe Will Rogers phenomenonGame Theory and the Nash EquilibriumEvolutionary Game TheoryAdvanced Game Theory Overview,

Below this lines  you will find other links, for if you liked this story and you want to share it with your friends.

And don't forget to take a walk by the Carnival of Mathematics. There you'll find lots of excellent math posts that you'll surely like too.