paint-brush
Maitiro ekushandisa Propensity Score Matching: A Danho-ne-Nhanho Gwaroby@ngl21
438 kuverenga
438 kuverenga

Maitiro ekushandisa Propensity Score Matching: A Danho-ne-Nhanho Gwaro

by Angela Nedopekina9m2024/11/04
Read on Terminal Reader

Kurebesa; Kuverenga

Propensity Score Matching imwe yekuwedzera A/B bvunzo nzira inoshandiswa kana sampuli randomisation ikasashanda. Propensity mamakisi (mukana wekupihwa kuboka rekuyedza) reboka rekurapa rinoverengerwa kune wese mushandisi uye ipapo mushandisi anofananidzwa nemumwe mushandisi zvichibva pane zvakaitika kare zvekushandiswa kwechigadzirwa kuumba boka rinodzora. Mushure mezvo, mhedzisiro yemapoka maviri inofananidzwa uchishandisa bvunzo yenhamba uye chiitiko chekuyedza chinoyerwa. Iri pasi apa hurongwa hwePSM: 1) Unganidza iyo data pane iyo propensity mamakisi inofungidzirwa uye anofananidzwa mushandisi anowanikwa. 2) Fungidzira chibodzwa uchishandisa nzira, senge logistic regression, uye dzidzisa pane dataset kufanotaura kana mushandisi achapihwa kuboka rebvunzo. Kune wese mushandisi, modhi yakadzidziswa inoburitsa mukana wekuve muboka rebvunzo. 3) Mechi yakavakirwa pane propensity mamaki, apo nzira dzakasiyana dzekufananidza dzinoyedzwa, senge muvakidzani wepedyo.  4) Chiyero che covariates pakati pekurapa uye mapoka ekutonga anotariswa nekuverenga chiyero chekuenzanisa uye kugadzira zvirongwa. Chiyero chisina kunaka chinoratidza kuti modhi inofungidzira propensity mamakisi inoda kutsanangurwa. 5) Mhedzisiro yemuyedzo inofungidzirwa uchishandisa data rakafanana uye bvunzo yehuwandu inoitwa. Pane zvimwe zvipimo pakushandisa iyo PSM.
featured image - Maitiro ekushandisa Propensity Score Matching: A Danho-ne-Nhanho Gwaro
Angela Nedopekina HackerNoon profile picture
0-item


Munguva ichangopfuura ndanga ndichishanda pakuedza kwakavakirwa paPropensity Score Matching uye ndichiri kutsvaga ruzivo ndakasangana nekushaikwa kwezvinhu pamusoro penyaya. Zvizhinji zvezvinyorwa zvandakawana zviri pamusoro pekubudirira kweiyo nzira uye hazvina kunyatsotsanangurwa maererano nedzidziso. Naizvozvo, ndafunga kugovera newe gwara rakazara pane Propensity Score Matching chimiro nematanho ayo.

Chii chinonzi Propensity Score Matching uye nei uchiishandisa?

"Propensity mamakirwo ekufananidza kunosanganisira kuumba seti dzakafanana dzezvidzidzo zvakarapwa uye zvisina kurapwa zvinogovana kukosha kwakafanana kweiyo propensity mamaki. Kana sampu inofananidzwa yaumbwa, mhedzisiro yekurapa inogona kufungidzirwa nekuenzanisa zvakananga mhedzisiro. "


Tsanangudzo iyi yakatanga kupihwa naRosenbaum PR, Rubin DB muchinyorwa "Kuongorora kunzwisiswa kune isina kucherechedzwa bhinari covariate muchidzidzo chekucherechedza nemhedzisiro" ye1983.


Kuzvitaura zviri nyore, iyi imwe yekuwedzera A/B bvunzo nzira inoshandiswa kana sampuro randomisation ikasashanda . Propensity mamakisi (mukana wekupihwa kuboka rekuyedza) reboka rekurapa rinoverengerwa kune wese mushandisi uyezve mushandisi anofananidzwa nemumwe mushandisi zvichibva pane zvakaitika kare zvekushandiswa kwechigadzirwa kuumba boka rinodzora. Mushure mezvo, mhedzisiro yemapoka maviri inofananidzwa uchishandisa bvunzo yenhamba uye chiitiko chekuyedza chinoyerwa.


Mufananidzo 1. Propensity Score Matching Illustration


Asi sei uchishandisa nzira yakaoma yekutsvaga boka rekutonga kana A / B chikuva ichigona kuzviita panzvimbo? Mune zvimwe zviitiko hazvigoneke kushandisa A/B papuratifomu ine yakavakirwa-mukati kupatsanura basa. Heano maitiro anogona kuitika:


  1. Zvinodhura uye zvinopedza nguva kuti bhizinesi riite bvunzo yeA/B nekuda kweakasiyana siyana anoshanda basa uye kutaurirana (hongu, zvinogona kuitika).
  2. Kambani inogona kushaya purofiti ichiri kuyedza kuita mari kana kushambadzira-ane chekuita nemabasa / hurongwa kana ikaita yekirasi bvunzo / yekudzora kuyedza.
  3. Kazhinji hazvigoneke kuve nesampling isina kurongeka mukuedza pasina Indaneti inodiwa pabvunzo yeA/B.
  4. Izvo zvakare hazvishande kuisa bvunzo yeA/B mukuedza kwepamhepo kana chinhu chakayedzwa chisina kugovaniswa.


Ndakanga ndine nyaya yechina mukuita kwangu uye zvakaitika ndichishanda ne e-commerce chigadzirwa. Chikwata chechigadzirwa chaigadzirira kuyedza basa rekupa mabhonasi kune vashandisi mushure mekuisa yekutanga odha. Dambudziko nderekuti basa racho rakanga richishanda kwete kune vese vashandisi vaiisa yekutanga odha. Mimwe mamiriro ezvinhu, akadai sekukosha kwehurongwa, nezvimwewo, zvaifanira kuzadzikiswa. Muchiitiko ichi, zviri kupfuura miganhu yeA/B bvunzo papuratifomu kupatsanura traffic pakati pebvunzo nemapoka ekudzora. Heino chikonzero nei Propensity Score Matching yaive sarudzo.

Framework yePropensity Score Matching

A c yakakwana sisitimu yakavakirwa pachinyorwa " Propensity mamaki inofananidzwa neR: yakajairwa nzira uye maficha matsva "uye ine matanho mashanu (Mufananidzo 2).


Danho rekutanga nderekuunganidza iyo data painofungidzirwa chibodzwa chechimiro uye mushandisi anofananidzwa anowanikwa.


Danho rechipiri ndere kufungidzira zvibodzwa uchishandisa nzira, senge logistic regression, uye kudzidzisa pane dataset kufanotaura kana mushandisi achapihwa kuboka rebvunzo. Kune wese mushandisi, modhi yakadzidziswa inoburitsa mukana wekuve muboka rebvunzo.


Nhanho yechitatu inoreva kuenzanirana kwakavakirwa pane propensity mamaki, apo nzira dzakasiyana dzekufananidza dzinoyedzwa, semuvakidzani wepedyo.


Muchinhanho chechina, kuenzana kwema covariates pakati pekurapa uye mapoka ekudzora anotariswa nekuverenga kuenzanisa nhamba uye kugadzira zvirongwa. Chiyero chisina kunaka chinoratidza kuti modhi inofungidzira propensity mamakisi inoda kutsanangurwa.


Muchishanu chekupedzisira nhanho, mhedzisiro yebvunzo inofungidzirwa kushandisa inofananidzwa data uye bvunzo yehuwandu inoitwa.


Mufananidzo 2. Propensity Score Matching Framework

Data Collection

Iyi nhanho ndeyekuunganidza zvinodiwa zvinosiyana, covariates uye confounders. Covariate (X) ishanduko yakazvimirira inogona kukanganisa mhedzisiro yekuyedza (Y), asi iyo isiri yekufarira zvakananga. Confounder chimwe chinhu chisiri icho chiri kudzidzwa chinobatanidzwa zvese nekugoverwa kuboka rebvunzo (W) uye nemhedzisiro yekuyedza (Y).


Girafu iri pazasi inoratidza hukama hwezvakasiyana. X ndeye covariate, W chiratidzo chekupihwa kurapwa, uye Y ndiyo mhedzisiro. Girafu riri kuruboshwe rinoratidza hukama hwe confounder uye iyo iri kurudyi inoratidza yakazvimirira kubatana kwecovariate kune mhedzisiro yekuedza (Y) uye yekuyedza kugoverwa kweboka (W).


Mufananidzo 3. Ukama hwe covariate, test assignment uye mugumisiro. 


Pano zvakakosha kudonhedzera pasi kuti hazvikurudzirwe kusarudza chete mavheti ane hukama nekupihwa kwevashandisi kuboka rebvunzo (W) nekuti zvinogona kudzikisa iko chaiko mukuongororwa kwemutsauko weboka pasina kudzikira ( https://www.ncbi .nlm.nih.gov/pmc/articles/PMC1513192/ ).


Iwe unogona kubvunza kuti mangani emhando dzandinofanira kusarudza? Mhinduro iri nyore - zvakanyanya, zvirinani kuitira kuti uwane fungidziro yepamusoro yezvabuda uye kuderedza kurerekera pakudzidza . Uye pano ndiri kutaura nezve nhamba huru se 20-50 kana kutopfuura.

Propensity Score Estimation

Kuenderera mberi kune nhanho inotevera, inodiwa kuunganidza data uye kuseta mureza wekuve weboka rekurapa. Vamwe vese vashandisi vanogona kuumba boka rekutonga. Mushure mezvo propensity mamakisi inofungidzirwa uchishandisa nzira dzakasiyana, senge logistic regression kana masango asina kurongeka.


Zvizhinji zvezvinyorwa zvandakaverenga zvinokurudzira kuomerera kune logistic regression uye kusashandisa mamwe mamodheru akaomarara sezvo kurongeka kwepamusoro hakusi crucia l. Asi, kubudirira kwekufananidza nzira inotarisana nekururama.


Mushure mekusarudza nzira, muenzaniso wekufungidzira unodzidziswa pane data uchishandisa covariates yakasarudzwa kufanotaura kana mushandisi ari weboka rekuyedza. Chekupedzisira, iyo modhi inofanotaura kune wega mushandisi, uye propensity mamaki, mukana wekuve muboka rebvunzo, inoverengerwa. Panyaya yemasoftwares, muPython unogona kushandisa chero raibhurari yekufungidzira kutanga kubva kune basic scikit-dzidza uye kutamira kuMuporofita.

Data Matching

Chiitiko chinotevera ndechekushandisa nzira yekufananidza kutsvaga mushandisi anoenderana nemushandisi kubva muboka rekuyedza. Nokudaro, boka rekutonga rinoumbwa.


Pane nzira dzakasiyana dzekufananidza dzekusarudza kubva, semuenzaniso kunyatsofananidza kana Mahalanobis chinhambwe chinofananidzwa. Muchikamu chino ini ndichanyanya kukurukura nzira yakajairika yekufananidza muvakidzani wepedyo uye nekusiyana kwayo.


Muvakidzani wepedyo anofananidza (NNM) inoumbwa nezvikamu zviviri. Kutanga, iyo algorithm inotora vashandisi, mumwe nemumwe kubva kuboka rekurapa, mune yakatarwa. Zvadaro, kune wega wega mushandisi weboka rekuyedza, iyo algorithm inowana mushandisi muboka rekutonga ane iri pedyo propensity mamakisi. Aya matanho anodzokororwa kusvika pasina vashandisi vasara muyedzo kana mapoka ekutonga. MuPython, kune chaiwo maraibhurari ePSM sePyTorch, Psmpy , causallib . Kana kuti iwe unogara uchigona kunamatira kune chero yekare raibhurari ine anofananidza algorithms.


Izvo zvakakosha kuti uise pasi pasi kuti kana ukagadzira boka rekutonga rakafanana neyekare A/B bvunzo, apo vashandisi vari muboka vakasiyana uye saizi dzemuenzaniso dzakaenzana, NNM isina nzira yekutsiva inofanira kuitwa. Iyo nzira inoreva kuti mushure mekufananidza, iyo inofananidzwa pair ichabviswa, kuitira kuti mushandisi muboka rekutonga ashandiswe kamwe chete.


Pane zvakare sarudzo yekusarudza modhi yeNNM ine kana isina caliper. A caliper inoisa muganho wepamusoro wechinhambwe chezvibodzwa zvepropensity mupeya inofananidzwa. Nekudaro, mushandisi wega wega anogona chete kuenzaniswa nevashandisi veiyo propensity mamaki mukati mechikamu chidiki. Kana vashandisi vanokodzera vasingakwanisi kufananidzwa, mushandisi anoraswa.


Sei ndichifanira kushandisa caliper? Zvinokurudzirwa kuishandisa kana chinhambwe chezvibodzwa muzvikamu zvakafananidzwa chingave chakakura. Paunenge uchifunga nezve saizi yecaliper, funga zvinotevera: kana kuenzanisa kuita kusingagutsi, kuenzanisa kunogona kuitwa neyakaomesesa caliper uye kana kuenzanisa kuchibudirira asi huwandu hwemapeya akafananidzwa idiki, caliper inogona kukwidziridzwa ( https:/ /www.ncbi.nlm.nih.gov/pmc/articles/PMC8246231/ ).

Balance Diagnostics

Munguva iyi inotariswa kana covariates ebvunzo uye akafananidzwa ekudzora mapoka akaenzana, nekudaro, inoti kana mutambo wakarurama.

Iyo inhanho yakakosha sezvo isina kuenzanisa covariates inotungamira kune isiriyo A / B bvunzo mhinduro kuenzanisa.

Pane nzira nhatu dzekuenzanisa diagnostics:


-Nhamba dzinotsanangura: yakamisikidzwa zvinoreva musiyano (SMD) kana musiyano reshiyo (VR)

- nhamba miedzo

- kuona: qq-plot, histogram kana rudo chiitiko


Muchinyorwa ini ndinonyanya kuisa pfungwa pane yekutanga uye yechitatu sarudzo.


Chekutanga, ngatikurukurei yakamisikidzwa kureva musiyano uye musiyano reshiyo. Ndeapi maitiro anoratidza kuti covariate yakaenzana? Ini ndinokurudzira kuti kukosha kweSMD kuri pazasi 0.1 Panyaya yeVR, kukosha kuri pedyo ne1.0 kunoratidza chiyero .


Chechipiri, maererano nemaitiro ekuona, imwe yenhamba inotsanangura iri pamusoro inoverengerwa kune yega yega covariate uye inoratidzwa nemifananidzo. Ini pachangu ndinofarira chirongwa cherudo sezvo vese covariates vanogona kuiswa mugirafu rimwe uye covariates pamberi uye mushure mekufananidza zvinogona kuenzaniswa zviri nyore. Ndinoisa muenzaniso wegirafu pazasi.


Mufananidzo 4. Rudo chirongwa che covariates SMD pamberi uye mushure mekugadzirisa


Ko kana ma covariates achiri asina kugadzikana mushure mekufananidza? Kuenzanisira, yakajairwa mutsauko (SMD) yecovariates frequency yekutenga uye AOV inotenderedza 0.5, iyo iri pamusoro inodiwa 0.1. Zvinoreva kuti ma covariate haana kuenzana uye kudzokorora kunodiwa.

Imbalanced covariates chiratidzo chePSM modhi haishande uye inoda kuvakwa patsva. Naizvozvo, zvinofanirwa kuenda nhanho shoma kumashure uye kudzokorora kufananidza.

Pane nzira ina dzekuita patsva kuenzanisa:


1. Wedzera covariates itsva

2. Ingoshandura nzira yekufananidza sezvo pane zvakawanda zvavo

3. Batanidza Propensity Score Matching nechaiyo yekufananidza nzira

4. Wedzera saizi yemuenzaniso

Estimate of Treatment Effects

Chekupedzisira, tave kusvika padanho rekupedzisira apo mhedzisiro yekuyedza inofungidzirwa. Kune kunyanya matatu marudzi efungidziro yemhedzisiro: avhareji yekurapa maitiro (ATE), avhareji kurapwa maitiro pane akarapwa (ATT), uye avhareji kurapwa maitiro pakutonga (ATC). Chaizvoizvo, ATE musiyano wakaverengerwa mune kiyi metric pakati pebvunzo nemapoka ekudzora (zvakafanana kuyera metric hombe muyedzo yeA/B). Inoverengerwa senzira yekurapa maitiro, ATE = avg (Y1 - Y1) sezvakaratidzwa pazasi mumufananidzo.


Mufananidzo 5.Formulas uye mienzaniso yekuverenga yemigumisiro yekurapa 


Nepo ATT neATC iri avhareji yekurapa maitiro ebvunzo uye yekudzora boka, zvichiteerana. Yese inzira dzakananga uye dzinonzwisisika dzekufungidzira.


ATE ndiyo yakajairika mhando uye inoshandiswa kana kudzora uye mapoka makuru metric akaenzaniswa uye akaedzwa maitiro anoyerwa. Nepo ATT neATC zvichisarudzwa kana absolute metrics achidikanwa kuboka rega rega. Pakupedzisira, kuongororwa kwenhamba kwakakodzera kunoitwa kutarisa kukosha kwehuwandu hwemhedzisiro.

Kuganhurirwa kwePropensity Score Matching

Mushure mekutsanangurwa kwakadzama kweiyo Propensity Score Matching nzira, ingave nguva yekutanga kuishandisa mubasa rako, asi pane zvimwe zvinogumira zvinofanirwa kutariswa.


1. Bootstrap haikurudzirwe kuti ishandiswe nePropensity Score Matching sezvo ichiwedzera musiyano. ( https://economics.mit.edu/sites/default/files/publications/ON THE FAILURE OF THE BOOTSTRAP FOR.pdf )

 2. Stable unit treatment value assumption (SUTVA) principle must be met. 3. Propensity Score Matching implies using two machine learning algorithms (one for propensity score calculations and the second one for matching), which can be a pricy method to use for a company. On that account, it's advisable to negotiate with your team on A/B test conduction. 4. Finally, as discussed above, a big number of covariates are suggested to be used in the models. Thus, it requires a high-powered machine(-s) to calculate the results of the models. Again, it's a costly method to implement.

Nekudaro, kana zvichikwanisika kuita Propensity Score Matching , zviite uye usazeze kuwedzera ruzivo rwako uye ruzivo runoshanda. Rombo rakanaka nemiedzo yako yeramangwana uye kuwana muchina kudzidza



Ungada here kupindura mimwe yemibvunzo iyi? Iyo link ye template ndeye HERE . Unofarira kuverenga zvirimo kubva kune zvese zvatinokurudzira kunyora? Dzvanya HERE .