Pentagon Cache of Over 1.8 Billion Scraped Social Media Posts Left Unsecured on Amazon Server
The data archive was
According to UpGuard's report, the data appeared to have "an emphasis on Arabic, Farsi (spoken in Iran and Afghanistan), and a number of Central and South Asian dialects spoken in Afghanistan and Pakistan," suggesting that program was related to U.S. military and intelligence operations in central Asia. However, as seen below, the Pentagon program apparently swept up public posts by Americans, including airing their political views.
As noted by Ars Technica , the data may have been being used to fuel Outpost, which posts by former VendorX employees described as a "multi-lingual platform designed to positively influence change in high-risk youth in unstable regions of the world."
Military officials say the program in question was conducted using "commercial off-the-shelf programs" and downplayed why it had collected posts of U.S. citizens-which is yet another reminder of just how casually authorities can spy on the public as well as potentially illegal. The government also questioned that the program was particularly interesting or scandalous.
"Once alerted to the unauthorized access, CENTCOM implemented additional security measures to prevent unauthorized access," CENTCOM spokesman Major Josh Jacques told PC Mag . "... The information you are asking about is not sensitive information. It is not collected nor processed for any intelligence purposes."
Big data archives leaking online due to substandard security has become a major point of concern in 2017, especially following the recent leak of over 145 million Americans' sensitive personal information from credit history company Equifax . In this case, changing a simple privacy setting would have kept the files hidden from public view. UpGuard has previously found data caches from Viacom , Verizon , Tigerswan , Dow Jones , Deep Root Analytics , and Booz Allen similarly unsecured.
[ PC Mag ]