티스토리 뷰
[Annotated] Advances in AI are used to spot signs of sexuality
af334 2017. 9. 11. 01:45Machines that read faces are coming
Modern artificial intelligence is much feted 1. But its talents boil down to 2 a superhuman ability to spot patterns in large volumes of data 3. Facebook has used this ability to produce maps of poor regions in unprecedented detail, with an AI system that has learned what human settlements 4 look like from satellite pictures 5. Medical researchers have trained AI in 6smartphones to detect cancerous lesions 7 8; a Google system can make precise guesses about the year a photograph was taken, simply because it has seen more photos than a human could ever inspect 9, and has spotted patterns that no human could.
AI’s power to pick out patterns 10 is now turning to more intimate matters 11. Research at Stanford University by Michal Kosinski and Yilun Wang has shown that machine vision can infer sexual orientation 12 13 by analyzing people’s faces. The researchers suggest the software does this by picking up on subtle differences in facial structure 14 15. With the right data sets 16, Dr Kosinski says, similar AI systems might be trained to spot other intimate traits 17 18, such as IQ or political views 19. Just because humans are unable to see the signs in faces does not mean that machines cannot do so.
The researchers’ program, details of which are soon to be published in the Journal of Personality and Social Psychology, relied on 20130,741 images of 36,630 men and 170,360 images of 38,593 women downloaded from 21a popular American dating website, which makes its profiles public. Basic facial-detection technology 22 was used to select all images which showed a single face of sufficient size and clarity to subject to analysis 23. This left 2435,326 pictures of 14,776 people, with gay and straight, male and female, all represented evenly 25 26.
Out of the numbers
The images were then fed into a different piece of software 27 called VGG-Face, which spits out a long string of numbers to represent each person; their “faceprint 28”. The next step was to use a simple predictive model 29, known as logistic regression 30, to find correlations between 31the features of those faceprints and their owners’ sexuality (as declared on the dating website). When the resulting model was run on data which it had not seen before, it far outperformed humans at distinguishing between 32 33 gay and straight faces.
When shown one photo each of a gay and straight man, both chosen at random 34, the model distinguished between them correctly 81% of the time. When shown five photos of each man, it attributed sexuality correctly 91% of the time. The model 35performed worse with women, telling gay and straight apart with 71% accuracy after 36 looking at one photo, and 83% accuracy after five. In both cases the level of performance far outstrips human ability to 37make this distinction 38. Using the same images, people could tell gay from 39straight 61% of the time for men, and 54% of the time for women. This aligns with 40research which suggests humans can determine sexuality from faces at only just better than chance.
Dr Kosinski and Mr Wang offer a possible explanation for 41 their model’s performance. As fetuses develop in the womb 42, they are exposed to various levels of hormones, in particular testosterone. These are known to play a role in developing facial structures 43 44, and may similarly be involved in determining sexuality. The researchers suggest their system can pick up subtle signals of the latter 45 from the former 46. Using other techniques, the program was found to pay most attention to the nose, eyes, eyebrows, cheeks, hairline 47 and chin for determining male sexuality; the nose, mouth corners, hair and neckline 48 were more important for women.
The study has limitations. Firstly, images from a dating site are likely to be particularly revealing of sexual orientation. The 91% accuracy rate 49 only applies 50 when one of the two men whose images are shown is known to be gay. Outside the lab the accuracy rate would be much lower. To demonstrate this weakness, the researchers selected 1,000 men at random with at least five photographs, but in a ratio of 51 gay to straight that more accurately reflects the real world 52; approximately seven in every 100. When asked to select the 100 males most likely to be gay, only 47 of those chosen by the system actually were, meaning that the system ranked some straight men as 53 more likely to be gay than men who actually are.
However, when asked to pick out 54the ten faces it was most confident about, nine of the chosen were in fact gay. If the goal is to pick a small number of people who are very likely to be gay out of a large group, the system appears able to do so. The point is not that Dr Kosinski and Mr Wang have created software which can reliably determine 55gay from straight. That was not their goal. Rather, they have demonstrated that such software is possible.
To calculate the selves of 56 others 57
Dr Kosinski is no stranger to controversial research. He invented psychometric profiling 58 59using Facebook data, which relies upon information in 60 a person’s profile to model their personality 61. The Trump campaign used similar models during last year’s presidential campaign to target voters 62, an approach which has generated criticism 63.
Dr Kosinski says he conducted the research as a demonstration, and to warn policymakers of the power of machine vision 64. It makes further erosion of privacy “inevitable”; the dangers must be understood, he adds. Spouses 65 might seek to know what 66sexuality-inferring 67 software says about their partner (the word “gay” is 10% more likely to complete searches that begin “Is my husband…” than the word “cheating”). In parts of the world where being gay is socially unacceptable, or illegal, such software could pose a serious threat to safety 68 69. Dr Kosinski is at pains to make clear that 70he has invented no new technology, merely bolted together software and data that are 71readily available to 72 anyone with an internet connection. He has asked The Economist not to reveal the identity of the dating website he used, in order to discourage copycats 73.
It is true that anyone wishing to replicate 74 Dr Kosinski’s work to determine intimate traits from faces will face significant challenges in applying laboratory science to the outside world 75. But they will be helped by ever-growing volumes of data 76 and improving algorithms. “The latter, over time, inevitably win,” says Alessandro Acquisti of Carnegie Mellon University, who has shown that an individual’s social security number can be discovered using face recognition and online information. For those with secrets to keep, all this is bad news.
- fete ; [타동사][VN] [주로 수동태로] (격식) (공식적으로) 환영[환대]하다 [본문으로]
- boil down to ; (진행형으로는 쓰이지 않음) 핵심[본질]이 …이다, 결국 …이 되다, …으로 요약하다. [본문으로]
- spot ; (-tt-) 1. [진행형으로는 쓰이지 않음] 발견하다, 찾다, 알아채다(특히 갑자기 또는 쉽지 않은 상황에서 그렇게 함을 나타냄) [본문으로]
- human settlement ; (건축용어) 거주지(居住地) [본문으로]
- satellite pictures ; 위성(으로 보내오는) 화면들 [본문으로]
- train ; 1. ~ (sb) (as/in/for sth) (특정한 직업・일을 위해) 교육[훈련]시키다; 교육[훈련]받다 [본문으로]
- cancerous ; [형용사] 암의; 암에 걸린; 불치의; 독성의 [본문으로]
- lesion ; [명사] (의학) 병변, 병소 ;; US·UK [|li:Ʒn] [본문으로]
- inspect ; 1. ~ sth/sb (for sth) (특히 모든 것이 제대로 되어 있는지 확인하기 위해) 점검[검사]하다 [본문으로]
- pick out ; [동사] 선택하다; 뽑아[쪼아] 내다; 듣고 분간하다; 가려내다; 장식하다. ;; 동의어 ; choose; extract; recognize; discriminate; deck out. [본문으로]
- intimate ; 2. (흔히 성생활과 관련된) 사적인[은밀한] [본문으로]
- infer ; (-rr-) 1. ~ sth (from sth) 추론하다 [본문으로]
- sexual orientation ; [명사] 완곡적 성적지향; (특히) 동성애(gay). (또는 sexual preference) [본문으로]
- pick up on sth ; 1. ~을 이해하다[알아차리다] [본문으로]
- facial structure ; 안면 구조, 얼굴 구조 [본문으로]
- data set ; [명사] (컴퓨터) 데이터 세트(컴퓨터상의 데이터 처리에서 한 개의 단위로 취급하는 데이터의 집합) [본문으로]
- be trained to ; ~하도록 훈련받다 [본문으로]
- trait ; [명사] (성격상의) 특성 [본문으로]
- political view ; 정치관, 출처관, 정관 [본문으로]
- rely on ; 1. ~에 의지[의존]하다, ~을 필요로 하다 [본문으로]
- download ; [타동사][VN] (컴퓨터) (데이터를) 다운로드하다[내려받다] ;; 참고 ; load [본문으로]
- detection ; [U] 발견, 간파, 탐지 [본문으로]
- clarity ; 3. (사진・소리・물질의) 선명도[투명도] [본문으로]
- leave ; 6. SB/STH IN CONDITION/PLACE | (어떤 결과를) 남기다[(남겨) 주다] [본문으로]
- represent ; 6. IN PICTURE | ~ sb/sth (as sb/sth) (특히 그림으로) 보여주다[제시하다] [본문으로]
- evenly ; 2. 균등하게, 고르게; 대등하게 [본문으로]
- be fed into ; ~로 들어가다, 넣다, 반영되다, 먹이다 [본문으로]
- faceprint ; [명사] 페이스프린트 ((신원 확인을 위해 데이터베이스화한 사람 얼굴의 디지털 사진)) [본문으로]
- predictive model ; 예측 모델, 예측 모형 [본문으로]
- logistic regression ; In statistics, logistic regression, or logit regression, or logit model is a regression model where the dependent variable (DV) is categorical. This article covers the case of a binary dependent variable—that is, where it can take only two values, "0" and "1", which represent outcomes such as pass/fail, win/lose, alive/dead or healthy/sick. Cases where the dependent variable has more than two outcome categories may be analysed in multinomial logistic regression, or, if the multiple categories are ordered, in ordinal logistic regression. In the terminology of economics, logistic regression is an example of a qualitative response/discrete choice model. [본문으로]
- correlation ; [C , U] ~ (between A and B) | ~ (of A with B) 연관성, 상관관계 [본문으로]
- outperform ; [타동사][VN] 더 나은 결과를 내다, 능가하다 [본문으로]
- distinguish between ; …을 구별하다. [본문으로]
- at random ; 무작위로[임의로/마구잡이로] ;; [명사] (수학) 임의로 [본문으로]
- attribute ; 2. ~ sth (to sb) (특히 말・글・그림 등을) …것[탓/책임]이라고 보다[말하다] ;; Verb [ə|trɪbju:t] Noun [|ӕtrɪbju:t] [본문으로]
- tell apart ; 구별하다, 분간하다. [본문으로]
- outstrip ; 2. (경쟁 상대를) 능가하다[앞서다] [본문으로]
- make a distinction ; 차별(구분)하다. [본문으로]
- tell ... from ; …를 구분하다(=differentiate). ;; to distinguish somebody/something from another person or thing [본문으로]
- align with ; 부합하다, 공조하다 [본문으로]
- offer[provide, give] a possible explanation ; ...에 대한 가능한 설명을 하다. [본문으로]
- fetus ; [명사] (임신 9주 후의) 태아(cf. EMBRYO) ;; US·UK [fí:təs] [본문으로]
- play a role ; ~에서 역할을 하다 [본문으로]
- facial structure ; 안면, 얼굴 구조 [본문으로]
- pick up ; (어떤 정보를) 듣게[알게] 되다; (습관·재주 등을) 들이게[익히게] 되다 [본문으로]
- latter ; 2. (기간・시기의) 후반의[끝 무렵의] ;; US.UK [|lӕtə(r)] [본문으로]
- hairline ; 1. 머리 선(특히 앞이마에 머리카락이 난 부분) [본문으로]
- neckline ; [명사] (특히 여성용 옷의) 목둘레선[네크라인] [본문으로]
- accurate rate ; 명중률, 적중률 [본문으로]
- apply ; 4. BE RELEVANT | [진행형으로는 쓰이지 않음] ~ (to sb/sth) 적용되다, 해당되다 [본문으로]
- ratio ; [명사] pl. -os ~ (of A to B) 비율, 비(比) [본문으로]
- reflect ; 3. [타동사][VN] (사물의 속성・사람의 태도・감정을) 나타내다[반영하다] [본문으로]
- rank ; [진행형으로는 쓰이지 않음] 1. GIVE POSITION | ~ (sb) (as sth) (등급・등위・순위를) 매기다[평가하다]; (등급・등위・순위를) 차지하다 [본문으로]
- pick out ; [동사] 선택하다; 뽑아[쪼아] 내다; 듣고 분간하다; 가려내다; 장식하다. ;; 동의어 ; choose; extract; recognize; discriminate; deck out. [본문으로]
- reliably ; [부사] 신뢰[신용]할 수 있어; 확실하게. [본문으로]
- calculate ; 2. 추정하다, 추산하다 [본문으로]
- selves ; SELF의 복수 ;; 2. [U] (the self [sing.]) (또한 격식) 자아, 자신 [본문으로]
- psychometric ; [형용사] (명사 앞에만 씀) (전문 용어) 정신력[정신 작용] 측정용의 [본문으로]
- profiling ; [U] (개요 작성을 위한) 자료[정보] 수집, 프로파일링 ;; 참고 ; racial profiling [본문으로]
- rely upon ; …에게 의존, 의지하다. [본문으로]
- model ; 3. CREATE COPY | [타동사][VN] 모형[견본]을 만들다 [본문으로]
- presidential campaign ; 대선 선거 운동 [본문으로]
- criticism ; 1. [U , C] ~ (of sb/sth) | ~ (that…) (좋지 못한 점을 지적하는) 비판, 비난 [본문으로]
- warn ... of ; ...에게 ~에 대해 경고하다, 충고하다, 주의를 주다 [본문으로]
- spouse ; [명사] (격식 또는 법률) 배우자 ;; US·UK [spaʊs;spaʊz] [본문으로]
- seek to ; ~하도록 시도, 추구하다 [본문으로]
- infer ; (-rr-) 1. ~ sth (from sth) 추론하다 [본문으로]
- pose a threat to ; ~에게 위협이 되다, 위협을 가하다 [본문으로]
- safety ; (pl. -ies) 1. [U] 안전(함) [본문으로]
- be at pains to do ; …하고자 애쓰다, 피나는 노력을 하다 [본문으로]
- bolt together ; 2. [타동사][VN] ~ A to B | ~ A and B together A와 B를 볼트로 접합하다 [본문으로]
- readily[easily] available ; 쉽게[간단히] 입수할 수 있는. [본문으로]
- copycat ; [명사] (비격식 못마땅함) 모방하는 사람, 흉내쟁이 [본문으로]
- replicate ; 1. [타동사][VN] (격식) (정확히) 모사[복제]하다 [본문으로]
- laboratory science ; 실험 과학 [본문으로]
- ever-growing ; 계속 늘어나는 [본문으로]
'Articles > Annotated' 카테고리의 다른 글
[Annotated] Why companies don't want you to take their brand names in vain (0) | 2017.09.13 |
---|---|
[Annotated] Why the world has so many Guineas (0) | 2017.09.13 |
[Annotated] The three pillars of Merkelism (0) | 2017.09.10 |
[Annotated] North Korea's nuclear test has left the South wanting nukes too (0) | 2017.09.09 |
[Annotated] President Trump winds down DACA (0) | 2017.09.06 |