www射-国产免费一级-欧美福利-亚洲成人福利-成人一区在线观看-亚州成人

US EUROPE AFRICA ASIA 中文
China / View

Better manage risks inherent in Big Data

By Ernest Davis (China Daily) Updated: 2017-02-13 08:36

In the last 15 years, we have witnessed an explosion in the amount of digital data available - from the Internet, social media, scientific equipment, smart phones, surveillance cameras, and many other sources - and in the computer technologies used to process it. "Big Data", as it is known, will undoubtedly deliver important scientific, technological, and medical advances. But Big Data also poses serious risks if it is misused or abused.

But having more data is no substitute for having high-quality data. For example, a recent article in Nature reports that election pollsters in the United States are struggling to obtain representative samples of the population, because they are legally permitted to call only landline telephones, whereas Americans increasingly rely on cellphones. And while one can find countless political opinions on social media, these aren't reliably representative of voters, either. In fact, a substantial share of tweets and Facebook posts about politics are computer-generated.

A Big Data program that used this search result to evaluate hiring and promotion decisions might penalize black candidates who resembled the pictures in the results for "unprofessional hairstyles," thereby perpetuating traditional social biases. And this isn't just a hypothetical possibility. Last year, a ProPublica investigation of "recidivism risk models" demonstrated that a widely used methodology to determine sentences for convicted criminals systematically overestimates the likelihood that black defendants will commit crimes in the future, and underestimates the risk that white defendants will do so.

Another hazard of Big Data is that it can be gamed. When people know that a data set is being used to make important decisions that will affect them, they have an incentive to tip the scales in their favor. For example, teachers who are judged according to their students' test scores may be more likely to "teach to the test," or even to cheat.

Similarly, college administrators who want to move their institutions up in the US News and World Reports rankings have made unwise decisions, such as investing in extravagant gyms at the expense of academics. Worse, they have made grotesquely unethical decisions, such as the effort by Mount Saint Mary's University to boost its "retention rate" by identifying and expelling weaker students in the first few weeks of school.

A third hazard is privacy violations, because so much of the data now available contains personal information. In recent years, enormous collections of confidential data have been stolen from commercial and government sites; and researchers have shown how people's political opinions or even sexual preferences can be accurately gleaned from seemingly innocuous online postings, such as movie reviews - even when they are published pseudonymously.

Finally, Big Data poses a challenge for accountability. Someone who feels that he or she has been treated unfairly by an algorithm's decision often has no way to appeal it, either because specific results cannot be interpreted, or because the people who have written the algorithm refuse to provide details about how it works. And while governments or corporations might intimidate anyone who objects by describing their algorithms as "mathematical" or "scientific," they, too, are often awed by their creations' behavior. The European Union recently adopted a measure guaranteeing people affected by algorithms a "right to an explanation"; but only time will tell how this will work in practice.

When people who are harmed by Big Data have no avenues for recourse, the results can be toxic and far-reaching, as data scientist Cathy O'Neil demonstrates in her recent book Weapons of Math Destruction.

The good news is that the hazards of Big Data can be largely avoided. But they won't be unless we zealously protect people's privacy, detect and correct unfairness, use algorithmic recommendations prudently, and maintain a rigorous understanding of algorithms' inner workings and the data that informs their decisions.

The author is a professor of computer science at the Courant Institute of Mathematical Sciences, New York University.

Project Syndicate

Highlights
Hot Topics

...
主站蜘蛛池模板: 日韩一区二区三区在线视频 | 黄 色 三 级 网站 | 亚洲精品久久久久久久福利 | 欧美日韩精品高清一区二区 | 国产香蕉在线视频一级毛片 | 九九免费精品视频在这里 | 美国一级毛片免费 | se94se最新网站| 大狠狠大臿蕉香蕉大视频 | 亚洲成人天堂 | 久久久久国产精品免费看 | 性视频网站在线 | 亚洲天堂2016 | 久久道| 毛片免费在线观看网址 | 久久久综合视频 | 久久久影院亚洲精品 | 4438全国最大成人网视频 | 亚洲无线一二三区2021 | 中国一级特黄剌激爽毛片 | 亚洲欧美日韩成人一区在线 | 免费观看欧美一级片 | 欧美午夜免费一级毛片 | 黄色片亚洲 | xh98hx国产免费 | 国产精品吹潮在线播放 | 1717she国产精品免费视频 | 三级视频网站在线观看 | 午夜剧场成年 | 亚洲欧美日韩久久精品第一区 | 午夜影院黄 | 亚洲精品一区二区三区在线观看 | 五月久久亚洲七七综合中文网 | 老色歌uuu26 老师张开腿让我爽了一夜视频 | 91精品国产高清久久久久 | 九九视频在线观看视频 | 九九视频在线播放 | 免费在线看a| 日本www色视频成人免费网站 | 欧美日韩一区二区三区在线观看 | 国产亚洲一欧美一区二区三区 |