While researching on the current massive meteorite strike in Russia this morning, one deep web resource has shown how useful it is for extracting out videos for local OSInt.
It is being added to the deep web list
While researching on the current massive meteorite strike in Russia this morning, one deep web resource has shown how useful it is for extracting out videos for local OSInt.
It is being added to the deep web list
The title says it all. We’re going to try to give a meaningful probability of the Turks imposing a No Fly zone for the Syrian Air Force over part or all of Syria.
Now usually, when people here the word statistics, they seek out the nearest bathroom and try to hang themselves. But hold on, my curious friend, this is going to be an interesting trip…
Shallow and Deep Web research pulls up these current factors-
After each item item I have 2 numbers – P(e|h) = the event happens given Turkey plans a no flight zone & P(e|nh) that the event happens given Turkey does not plan a no-fly zone
Factoring all of these into an intuitive estimate I currently give the estimate to be a 40% probability of Turkey imposing a military no fly zone for the Syrian AF in the next 60 days. This is just a gut instinct number. We’ll compare it to real math later.
I’ve also got to be careful not to fall under the intel analyst fallacy of ‘recency’ which is to overweight recent evidence like the civil air no-fly ban. Which I really really think should be heavily weighed.
Again, this part is where an good analysts expert opinion really helps the accuracy. I’m not an intel analysis, I just play one on TV.
>> What we seek! – P(h|e) The probability that h (our hypothesis) is true GIVEN (the | means ‘given’) that the evidence/event has happened.
P(h)= That probability the Turks will impose a no fly zone on the Syrian AF in the next 90 days.
P(e) = The probability that the event/evidence would happen whether the hypothesis is true or not.
P(e|h) = The probability that the evidence would happen given the hypothesis is true.
I’m going to put my neck out on a limb and put down an estimate of P(h) of the probability Turkey would make a no fly zone in any conflict. This is without knowing any current events, just that a conflict is underway. If I had time, I’d go back and research all conflicts Turkey has fought since 1960. Then basically count the times it has done a no-fly zone divided by conflicts fought. In this case, I’ll just throw down 10% as a SWAG. Thus P(h)=0.10
Basic Bayes theorem is
P(h|e)= (P(h) * P(e|h)) / P(e)
For prediction purposes and ease of use, the intel community has reordered the thing for easy updating to:
P(h|e)/P(nh|e)=[P(h)/P(nh) x P(e|h)/P(e|nh)]
and replace the awkward equation with symbols to get
R = P x L
So P is merely P(h)/P(nh) = 0.1/ (1-0.1) = 0.1111
Just to be clear here, my estimation of a no fly during a conflict without any events is a swag of 0.1 — therefore the P(nh) is (1-0.1)=0.9 because they either DO impose a no-fly or they DON’T. It has to add up to 1.0.
where L= P(e|h)/P(e|nh) which, as you notice is a ratio (Called the Likelihood). And you may notice that those numbers are estimated up top under my section called Factors.
Now here we go. I’m going to compute all the ratios under the section Factors in order from #1 – #7
0.8/0.85 x 0.2/0.2 x 0.7/0.5 x 0.8/0.4 x 0.85/0.8 x 0.85/0.65 x 0.8/0.5 = 5.858
So back to R = P x L
R = 0.1111 x 5.858
R = 0.6508
Remember R = P(h|e)/P(nh|e) is just a ratio aka Odds. This is saying there is a ~2/3:1 chance that Turkey will put the kabosh on Assad.
So don’t think this is a percentage aka 65%. Nope, that is wrong. The answer could just as easy been 2.23 and you can’t say 223% (not only a no-fly zone over Syria but Iran, too!). Therefore, we must convert to a percentage
Percentage = odds/ (odds +1)
= 0.6508/(0.6508 +1)
= 39.4233 % and to compensate for significant figures (we’re making guesses, duh) we round to 1 decimal.
= 40 % chance that Turkey will impose a no-fly zone inside Syria on Syria’s air force. Take note that my intuitive estimate was 40% so it passed my reality check. But the cool thing here, is analyst can argue not about the final number, but the P(e|h) & P(e|nh) given to each factor. It becomes a sort of accounting and audit method for predictions.
The cool thing about Bayes, is if more Factors come to light we can update our Ratio.
So, I reach out to all interested parties. If you think we should change some of our probabilities on the Factors, then put a comment below. If a new factor arises, post your P(e|h) & P(e|nh) for that factor.
+++++++++++++++
I’ve been give lots of feedback to take in new factors. Main flaw is that I have ignored negative factors, so I will update my Likelihood factor.
Last left at L=5.858
new R= 0.11 x 5.858x .12/.4 x .4/.6
R=0.21479
Percentage = .21470/(1+.21479) = 17%
With 1 sig fig it rounds to 20% which I think is more believable.
++++++++++++++++++++
Last R=0.21479 call it R1
R = R1 x .4/.5 x .4/.45 x .8/.85
= 0.14375
Percentage R/(1+R) = 12.56% and for 1 sig fig becomes…
= 10% chance Turkey will impose no-fly zone on Syria in the next 90 days.
FTR – Now would be a good time to review all the factors and rank the ones that affect R the most to the least. In this way, I could update my percentage of P() to more valid numbers and reduce the bias caused by recency on some events. ie. Syria closing it’s airspace to Turkish flights was given P(e|h) = .80 & P(e|nh)=.40 which I believe now to be too much weight on that event.
Now I see that my original gut instinct was way to high. But this one seems a bit low. However, Bayesian forecasting is proven to be better than the expert who gives the P() for factors. And I am definitely no expert on Syria.
Some brave, bold academic decided to stand up to a commercial pay-fee deep website that holds academic articles. The result you will read about below, but in essence, bad things happened.
This highlights a moral battle. There is a a great deal of financial gain to the gatekeepers of wonderful data behind fee-pay portals of Deep Web sites. Controversy erupts when the data can be argued to be public domain.
From Aljazeera News –
On July 19, 2011, Aaron Swartz, a computer programmer and activist, was arrested for downloading 4.8 million academic articles. The articles constituted nearly the entire catalogue of JSTOR, a scholarly research database. Universities that want to use JSTOR are charged as much as $50,000 in annual subscription fees.
Individuals who want to use JSTOR must shell out an average of $19 per article. The academics who write the articles are not paid for their work, nor are the academics who review it. The only people who profit are the 211 employees of JSTOR.
Swartz thought this was wrong. The paywall, he argued, constituted “private theft of public culture”. It hurt not only the greater public, but also academics who must “pay money to read the work of their colleagues”.
For attempting to make scholarship accessible to people who cannot afford it, Swartz is facing a $1 million fine and up to 35 years in prison. The severity of the charges shocked activists fighting for open access publication. But it shocked academics too, for different reasons.
http://www.aljazeera.com/indepth/opinion/2012/10/20121017558785551.html
http://www.strategypage.com/htmw/htintel/articles/20120917.aspx
From Strategypage – September 17, 2012: Last July the U.S. Army issued a manual, Army Techniques Publication 2-22.9, on using open source (mainly searching the Internet) intelligence. Also called OSINT, the troops have been using the Internet for intelligence work for over a decade. The publication of ATP 22.9 is a way for the senior army leadership to say, “message received and understood.” ATP 22.9, despite all the useful tips it contains, won’t go far in helping the many soldiers already using the Internet but it will be useful in convincing their bosses that much useful stuff can be obtained from the Internet.
While the U.S. intelligence community officially recognized the importance of OSINT (Open Source Intelligence) back in 2004, it was years before there was a lot of enthusiasm from the top brass for using this growing source of information.
The Internet has made OSINT a really, really huge source of useful intelligence. It’s not just the millions of gigabytes of information that is placed on the net but the even more voluminous masses of message board postings, blogs, emails, and IMs (instant messaging) that reveal what the culture is currently thinking. It was corporate intelligence practitioners who alerted the government intel people to the growing usefulness of Internet based data. Corporations have developed, over the last few decades, a keen interest in gathering intel on competitors, new markets, and all manner of things that might affect them. This “competitive intelligence” (or corporate spying) became big business. The Internet has made this a much more useful exercise.
However, corporate intel specialists were concerned that government agencies, especially the CIA, were not taking sufficient advantage of OSINT. Part of the problem was cultural. The intelligence agencies have always been proud of their special intel tools, like spy satellites, electronic listening stations, and spy networks. Most of these things are unique to government intelligence operations. People who use this stuff tend to look down on a bunch of geeks who simply troll the web. Even when the geeks keep coming up with valuable stuff, they don’t get any respect. The fear was that some foreign countries were exploiting OSINT more effectively than the United States. No foreign intel agency will admit to this but there are indications that some nations are mining the Internet quite intensively and effectively.
This fear grew as China, Russia, and other nations were caught using the Internet for direct espionage (hacking into other nations’ networks). While examining that threat it was discovered that the heavy use of OSINT was part of the hacking operations. Thus over the last five years the CIA and other major intel agencies got more enthusiastic about OSINT, and this made it easier for Internet-savvy army leaders to get ATP 22.9 into print.
j
Ok, so maybe I missed a few lectures in college. Perhaps quite a few. But via deep web research, I’ve found an excellent deep web science lecture resource for high authority, peer reviewed, lectures covering science and technology. Known at videolectures.net and provided free of charge to the public. The library contains 15,000 lectures of which 66% are in English. Provided to us for free courtesy of the Center for Knowledge Transfer subsidized by Slovenia. Hats off to those guys.