any longer. The research paper is describing how we calculate
the split score. In terms of split method, we have three different
methods implemented; simple BINARY, COMPLETE, BOTH.
Hobby=tennis, and so on. BOTH will take the best out of the
two methods for each split. The split method can be specified
Continuous inputs are handled differently from discrete.
continuous inputs and find the best cut-points in some way.
Let me know if you have more questions.
"Peter" <pwu_1024@yahoo.com> wrote in message
news:1cbe401c38882$6a7ed990$a601280a@phx.gbl...
> It is now working! Thank you all, Peter, Jamie and Raman.
>
> I checked out the two papers (shown in the following)
> listed on the FAQ in answering the question "Where do I
> get the details of the two algorithms? " However, they
> don't seem to address very clearly what the creteria is
> used to stop the splitting, which node to split and the
> discretization of continuous values. Is there any other
> document better addressing these issues? I know there are
> many academic papers talking about these issues, but my
> main concern is how they are handled in Microsoft Decision
> Tree.
>
> Papers I read:
> =====================================================
> - Correlation counting:
> Surajit Chaudhuri, Usama M. Fayyad, Jeff Bernhardt,
> Scalable Classification over SQL Databases. ICDE 1999: 470-
> 479
> Found in
>
http://ftp.research.microsoft.com/Users/surajitc/icde99.pdf
>
> - The default scoring methods (Bayesian Dirichlet
> Equivalent with Uniform prior):
> David M. Chickering; Dan Geiger; David Heckerman,
> Learning Bayesian Networks: The Combination of Knowledge
> and
> Statistical Data, MSR-TR-94-09, 1994
> Found in
>
http://www.research.microsoft.com/scripts/pubdb/pubsasp.asp
> ?recordID=81
> =======================================================
>
> Another thing is that when I try to get information on how
> to plug-in third party's algorithm from the following link
>
>
http://www.microsoft.com/sql/techinfo/BI/2000/dmproviderswp > .asp
>
> I got the "Page not found" error. Did I miss something?
>
> Thanks for any input.
>
> Peter
>
> >-----Original Message-----
> >You also need to register the DataMiningAddIns.dll.
> >
> >--
> >Raman Iyer
> >SQL Server Data Mining
> >[Please do not send email directly to this alias. This
> alias is for
> >newsgroup purposes and is intended to prevent automated
> spam. This posting
> >is provided "AS IS" with no warranties, and confers no
> rights.]
> >..
> >
> >"Peter" <pwu_1024@yahoo.com> wrote in message
> >news:0e5201c3885b$7b4e5b00$a101280a@phx.gbl...
> >> Jamie and Peter,
> >>
> >> Thanks a lot for the reply. The link works just fine.
> >>
> >> I have downloaded the following package
> >>
> >> DataMiningAddIns.exe
> >>
> >> and unzipped it. According to the readme.txt, I closed
> the
> >> running application Analysis Manager and ran the
> >>
> >> DataMiningAddIn.reg
> >>
> >> After that, I got the message saying that the
> registration
> >> is successful.
> >>
> >> Then, I started the Analysis Manager again. But here is
> >> the problem, when I right click the mining models of one
> >> database, say "Mushrooms", I can not see the "Advanced
> >> Model Properties" from the list.
> >>
> >> When I right click on server name and select properties,
> >> the "Add-ins" tab shows that "Mining model properties"
> as
> >> available Add-ins but there is also a yellow sign
> >> saying "this tab applies to the local computer only".
> >>
> >> I am not sure where went wrong. Any suggestions?
> >>
> >> I am using SQL Server 2000 (SP3), Analysis Manger SP3 on
> >> Windows 2000, FYI.
> >>
> >> Regards,
> >>
> >> Peter
> >>
> >> >-----Original Message-----
> >> >Yes, I just tried and it still there.
> >> >
> >>
> >
http://www.msnusers.com/AnalysisServicesDataMining/Documen > >> ts/Files%2FFAQ.htm
> >> >or
> >>
> >
http://www.msnusers.com/AnalysisServicesDataMining/faq.msn > >> w
> >> >
> >> >--
> >> >Peter Kim
> >> >This posting is provided "AS IS" with no warranties,
> and
> >> confers no rights.
> >> >
> >> >"Peter" <pwu_1024@yahoo.com> wrote in message
> >> >news:0e2501c386d6$955b5560$a401280a@phx.gbl...
> >> >> Jamie,
> >> >>
> >> >> Thanks for the reply. I am ware that website. But
> when I
> >> >> wend there, the webpage just can not be accessed. Can
> >> you
> >> >> confirm that it is still there?
> >> >>
> >> >>
> >>
>
http://groups.msn.com/AnalysisServicesDataMining/Documents/ > >> >> Files/FAQ%2Ehtm
> >> >>
> >> >> Thanks,
> >> >>
> >> >> Peter
> >> >>
> >> >> >-----Original Message-----
> >> >> >You can find many of your answers in the FAQ at
> >> >> >
http://groups.msn.com/AnalysisServicesDataMining > >> >> >
> >> >> >In particular you can control the way a tree splits
> and
> >> >> how deep it is with
> >> >> >the SPLIT_METHOD and COMPLEXITY_PENALTY
> parameters. At
> >> >> the above website,
> >> >> >there is a sample AM plug-in that provides a user-
> >> >> interface for setting
> >> >> >algorithm parameters.
> >> >> >
> >> >> >I believe you can also find the link to the sample
> >> OLEDB
> >> >> for Data Mining
> >> >> >provider that you can use as a basis for your own
> >> >> algorithms
> >> >> >
> >> >> >--
> >> >> >Jamie MacLennan
> >> >> >SQL Server Data Mining
> >> >> >
> >> >> >This posting is provided "AS IS" with no warranties,
> >> and
> >> >> confers no rights.
> >> >> >----------------------------------------------------
> ---
> >> ---
> >> >> -------
> >> >> >
> >> >> >"Peter" <pwu_1024@yahoo.com> wrote in message
> >> >> >news:0a1401c3846d$3b01eef0$a401280a@phx.gbl...
> >> >> >> Hi,
> >> >> >>
> >> >> >> I am using the built in Microsoft Decision Tree to
> >> >> perform
> >> >> >> some data mining task on my Analysis Server
> (SP3). I
> >> >> have
> >> >> >> some difficulty to understand how it pickes the
> node
> >> and
> >> >> >> how it splits and terminates, etc.. I'd really
> like
> >> to
> >> >> >> know how the algorithm works.
> >> >> >>
> >> >> >> Another question is: Is there anyway to control
> the
> >> >> >> splitting and terminating conditions from the
> >> Analysis
> >> >> >> Manager?
> >> >> >>
> >> >> >> The last question is that I read the Analysis
> Manager
> >> >> >> (SP3) supports the third party's algorithm plug
> in.