From: Murphy Choy on
Hi,

Can I propose that you take a look at roc curve and the gini coefficient?

If you see they are good, it may be just a case that one or two variables are bad.

------Original Message------
From: Alexis Lelex
Sender: SAS(r) Discussion
To: SAS-L(a)LISTSERV.UGA.EDU
ReplyTo: Alexis Lelex
Subject: Quality of logistic regression model
Sent: Oct 22, 2009 8:52 PM

Hi,

This is my first post here and my english is not well... so i'll do my
best to make me understand.
I'm modelling a logistic regression on more than 120 000 individuals, and
i get some very interesting results with my odds ratios, and all p-values
are <0,0001.
But some figures in the SAS output make look bad quality of the model:
high AIC and SC, low R2 and Tau-a, Hosmer and Lemeshow telling a lack of
fit...
Here's some part of my output:

Statistiques d'ajustement du modèle

Coordonnées Coordonnées � l'origine
Critères A l'origine Avec Covariables
AIC 91616.461 83858.175
SC 91626.215 84043.487
-2 Log L 91614.461 83820.175


R-Square 0.0595 Max-rescaled R-Square 0.1158


Association des probabilités prédites et des réponses observées

Percent Concordant 71.2 Somers' D 0.431
Percent Discordant 28.1 Gamma 0.434
Percent Tied 0.7 Tau-a 0.089
Pairs 1666460055 c 0.715


Test d'adéquation d'Hosmer et de Lemeshow

Khi 2 DF Pr > Khi 2

78.2312 8 <.0001


Is it possible to make the interpretation of the odds ratios, even though
there's a lack of fit and the model isn't predictive ?
In other words what conclusion can we take (or not) from a model like this
one ?

If someone can help me on this one it'll be really great !

Thanks

PS: by the way very good SAS forum, i learn a lots of things reading you
peoples !


Sent from my BlackBerry Wireless Handheld

--
Regards,
Murphy Choy

Certified Advanced Programmer for SAS V9
Certified Basic Programmer for SAS V9
From: Alexis Lelex on
Yes i put out the roc curves like this:
proc gplot data=roc1;
title 'ROC Curve';
plot _sensit_*_1mspec_=1 / vaxis=0 to 1 by .1 cframe=ligr;
run;

But i have no idea to telle something about it.

I read the Gini coefficient can be obtained like this:
Gini = 2*C statistics - 1
so in my case Gini=0,43
is it good or not ? i can't tell...
From: Alexis Lelex on
Thanks,

i heard the c statistics correspond to the area under the Roc curb, and
can variate between 0,5 and 1.

I already try to leave some covariates out of the model, but i still got
the same problem.
From: Murphy Choy on
Hi,

To understand roc curve, you can read the information from sas papers on roc.

Gini looks ok.

------Original Message------
From: Alexis Lelex
Sender: SAS(r) Discussion
To: SAS-L(a)LISTSERV.UGA.EDU
ReplyTo: Alexis Lelex
Subject: Re: Quality of logistic regression model
Sent: Oct 22, 2009 10:04 PM

Yes i put out the roc curves like this:
proc gplot data=roc1;
title 'ROC Curve';
plot _sensit_*_1mspec_=1 / vaxis=0 to 1 by .1 cframe=ligr;
run;

But i have no idea to telle something about it.

I read the Gini coefficient can be obtained like this:
Gini = 2*C statistics - 1
so in my case Gini=0,43
is it good or not ? i can't tell...


Sent from my BlackBerry Wireless Handheld

--
Regards,
Murphy Choy

Certified Advanced Programmer for SAS V9
Certified Basic Programmer for SAS V9
From: Murphy Choy on
Hi,

What abt the lift?

------Original Message------
From: Alexis Lelex
Sender: SAS(r) Discussion
To: SAS-L(a)LISTSERV.UGA.EDU
ReplyTo: Alexis Lelex
Subject: Re: Quality of logistic regression model
Sent: Oct 22, 2009 10:28 PM

Thanks,

i heard the c statistics correspond to the area under the Roc curb, and
can variate between 0,5 and 1.

I already try to leave some covariates out of the model, but i still got
the same problem.


Sent from my BlackBerry Wireless Handheld

--
Regards,
Murphy Choy

Certified Advanced Programmer for SAS V9
Certified Basic Programmer for SAS V9
 |  Next  |  Last
Pages: 1 2 3
Prev: logistics model.
Next: data manipulation problem