To Split or not to Split: The Impact of Disparate Treatment in Classification

نویسندگان

چکیده

Disparate treatment occurs when a machine learning model produces different decisions for individuals based on legally protected or sensitive attribute (e.g., age, sex). In domains where prediction accuracy is paramount, it could potentially be acceptable to fit which exhibits disparate treatment. To evaluate the effect of treatment, we compare performance split classifiers (i.e., trained and deployed separately each group) with group-blind do not use attribute). We introduce benefit-of-splitting quantifying improvement by splitting classifiers. Computing directly from its definition intractable since involves solving optimization problems over an infinite-dimensional functional space. Under measures, (i) prove equivalent expression can efficiently computed small-scale convex programs; (ii) provide sharp upper lower bounds reveal precise conditions classifier will always suffer non-trivial gap finite sample regime, necessarily beneficial data-dependent understand this effect. Finally, validate our theoretical results through numerical experiments both synthetic real-world datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

“ TCP Over OBS : To Split or Not To Split ? ”

Internet technology has advanced significantly over last decade. Now Internet is used not only to check emails or access information. Today’s Internet demands services such as video on demand, grid computing and very high data send rates which are bursty in nature. Current technology is unable to service such high bandwidth demands. Optical Burst Switching (OBS) technology shows huge potential ...

متن کامل

To split or not to split: Selecting the right server with batch arrivals

We consider a dispatching system, where jobs, arriving in batches, are assigned to single-server FCFS queues. Batches can be split to different queues on per job basis. However, the holding costs are batch-specific and incurred until the last member of the batch completes the service. By using the first policy improvement step of the MDP framework, we are able to derive robust dispatching polic...

متن کامل

from linguistics to literature: a linguistic approach to the study of linguistic deviations in the turkish divan of shahriar

chapter i provides an overview of structural linguistics and touches upon the saussurean dichotomies with the final goal of exploring their relevance to the stylistic studies of literature. to provide evidence for the singificance of the study, chapter ii deals with the controversial issue of linguistics and literature, and presents opposing views which, at the same time, have been central to t...

15 صفحه اول

the role of russia in transmission of energy from central asia and caucuses to european union

پس ازفروپاشی شوروی،رشد منابع نفت و گاز، آسیای میانه و قفقاز را در یک بازی ژئوپلتیکی انرژی قرار داده است. با در نظر گرفتن این منابع هیدروکربنی، این منطقه به یک میدانجنگ و رقابت تجاری برای بازی های ژئوپلتیکی قدرت های بزرگ جهانی تبدیل شده است. روسیه منطقه را به عنوان حیات خلوت خود تلقی نموده و علاقمند به حفظ حضورش می باشد تا همانند گذشته گاز طبیعی را به وسیله خط لوله مرکزی دریافت و به عنوان یک واس...

15 صفحه اول

Seronegative spondyloarthropathies: to lump or split?

The advent of novel biological therapies for the treatment of rheumatic disease has renewed interest in the seronegative spondyloarthropathies (SpAs). International efforts are redefining disease classification and measures of disease activity, outcome, metrology, and imaging. However, opinion is divided between those who propose that the SpA group represents the same disease with variable expr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Information Theory

سال: 2021

ISSN: ['0018-9448', '1557-9654']

DOI: https://doi.org/10.1109/tit.2021.3075415