How Can We Verify The Clustering Algorithm Models?
Dec 5, 2006
Hi, all here,
Since we are not able to use accuracy chart for Clustering algorithms there. So how can we verify the accuracy of clustering algorithm models here in terms of its classification and regression tasks?
Thank you very much in advance for your guidance and advices for that.
Another tricky confusion to me is that: many algorithms settings for the native algorithms in SQL Server 2005 Data Mining do not really significantly improve the results of those mining models with settings changes? (Apart from clustering algorithm setting of cluster number, by setting 0 as the number of clusters, the system will automatically cluster the data into clusters which I assume is the best way of mining the model with this method).
Any good advcies on this will be a lot appreciated.
I am looking forward to hearing from you shortly for this confusion and thanks a lot in advance.
I have few questions regarding Clustering algorithm.
If I process the clustering model with Ks (K is number of clusters) from 2 to n how to find a measure of variation and loss of information in each model (any kind of measure)? (Purpose would be decision which K to take.)
Which clustering method is better to use when segmenting data K-means or EM?
I have been asked by developers if there is any advantage in processing multiple clustering models simultaneously by using AMO and multiple threads as against processing one after another.
I have limited experience with Analysis Services but based on my reading I don't see this method providing any advantage.
Does anyone have any recommendations or advice? The system Enterprise Edition running on an x86 Server with 2 dual core processors and 4GB of RAM. Would the answer alter if the server running x64 version of SQL Server and Windows.
I have a question on sequence clustering algorithm. As generally it is used for sequence analysis especially for web path visiting analysis. Besides that, what else scenarios could we apply this algorithm as well?
Thanks a lot in advance and I am looking forward to hearing from you shortly.
I am having a question about the node_distribution.PRABABILITY. Some of the attribute values though have a small number of support for the specific node, but why it has a big node_distribution.probability even greater than 1? How can the node_distribution.PROBABILITY be greater than 1? How dose SQL Server 2005 data mining engine calculate the node_distribution.PRPBABILITY for its Clustering algorithm? Really confused and need guidance for that.
We have 2 env. : Testing and Production, both are running Windows 2003 Enterprise Server with SQL Server 2005. The difference is Testing is NOT running Windows cluster but Production do so, what is the best way to transfer a database from testing to production?
We have another systems that both testing and production are running on NON-cluster and we use backup/restore to transfer the database, can it apply in this case.
And I found that there are a tools called DTC, which can transfer all DB objects from one DB to another, is it a best way to transfer between non-cluster and cluster env.?
Hi all , this is my first post here hope i will find help. I'm facing problems in understanding some topics .. and i searched on the net but every time i read an article i find new things and topics. i think i need just a good way to start from the following topics.
-recovery models (full,simple,..). -fullback and differntial back up.
waiting for your help or even links to external articles may also help.
Hello Everybody , I have Some questions and I am sure that this is the best place for them to not be questions any more. Here are My Questions: What is the difference between full recovery model and simple recovery model?
My database has an mdf file about 400mb and a log file about 5GB. Is this caused because of full recovery model?
What can I do in order to make my log file smaller?
Which recovery Model Do you Suggest for Use and Why?
Hi.Does anyone know any web site with common database models presented andexplained.Database models like calendar, adressbook, storehouse...Thnx in advance
I am using the Excel 2007 data mining add-in, and don't have write perms on the Analysis Services server. Can I store datamining models in an alternate location?
I have a Sharepoint integrated Reporting Services server. I have created Report models based on Data Sources. I then created Report Builder Reports to make some BI Charts and Graphs. They look great and work fine for a few days then all of a sudden I get a rsItemNotFound error saying it cannot find the Report Model which is clearly there.
An error has occurred during report processing. (rsProcessingAborted)
The item 'IT Reports/PWPDB_Prod_Model.smdl' cannot be found. (rsItemNotFound)
If I open the Report Model from Report Builder it loads the data fields, but also fails to run reports with the same error
If I try to regenerate the model from the library I get an error stating that the model needs to be a generated model.
The model specified must be a generated model.
(rsModelNotGenerated)
Even after rebooting all of the servers in the farm I get the same messages. I can regenerate a model from the data source and it doesn't work. I am seeing no errors in the logs of my app server at all regarding reporting services.
It looks as though I will have to de-integrate Reporting Services as it is not viable for a business critical application.
Need a little help here. I would like to utilize the ad-hoc capabilities of Reporting Services to grant the users of one of my web apps the ability to create all sorts of crazy reports that I don't have to develop.
Basically, this web app is a flexible survey engine. My vision is to have the users select questions from a survey that they wish to generate statistics on. My application would create a view in the database that transposes the data into a tabular format. They would then get kicked over to the reporting services web client, where they can regenerate a report model, and then fire up report builder to create their crazy statiscal abomination. (Aren't users grand!)
This process sounds wonderful, but I'm not aware of any way to get the report model to regenerate and pickup any views. I can get it to pick up any new tables that are added as long as a primary key is created for them, but I want the automated dynamic data rendering that comes with a view. Hopefully, some of you out there have tried to do this already and have some ideas that may help me. Thanks!
I've created models with Decision Tree and Neural Network algorithms that predict continous target. But I don't know how to interpret scores that occure under scatter accuracy plot. How should I interpret scores under scatter accuracy plot? How can I estimate occuracy of model created with Time Series? How can I compare accuracy of model created with Time Series with models created with Decision Trees and Neural Network algorithms?
I haven't written any .net database updates before. I wrote the following code for a visual web developer application and it appears to work. There is no close function or anything. Will the adapter, builder, dataset, and row objects be cleaned up and disposed of properly? Please tell me if I need to include anything else. The connection object is made elsewhere. ThanksDim adapter As New Data.SqlClient.SqlDataAdapter("SELECT * FROM Inventory WHERE InUnique=" & ListValue.Value, connect) Dim builder As New Data.SqlClient.SqlCommandBuilder(adapter)Dim dataset As Data.DataSet = New Data.DataSet() adapter.Fill(dataset, "Inventory")Dim datarow As Data.DataRow = dataset.Tables(0).Rows(0) datarow("InAgUnique") = Val(InAgUnique.Text)datarow("InTyUnique") = Val(InTyUnique.Text) datarow("InName") = InName.Textdatarow("InPnUniquePrimary") = Val(InPnUniquePrimary.Text) datarow("InPnUniqueSecondary") = Val(InPnUniqueSecondary.Text)datarow("InPnUniqueTertiary") = Val(InPnUniqueTertiary.Text) datarow("InPnUniqueSupervisor") = Val(InPnUniqueSupervisor.Text)datarow("InDescription") = InDescription.Text datarow("InProblemInstructions") = InProblemInstructions.Text adapter.Update(dataset, "Inventory")
Hi, all! I'm really confused about how to run a query on the database to check to see the CustomerID filed value already exists in the database. If it’s true, I want to display a message: “Valid� the user to proceed with the next steps Else Display the customer number doesn’t exist in the database, and cancel. I tried all sorts of things and I just can't get it to do it. Hopefully you can help. Thank you
Hi .Net Guru’s,I have an urgent requirement for my project; the issue is mentioned below;Using .Net(C#/VB.Net) I need to generate/created Database objects from XML schemas.I don't have any sample xml schema file to give you. You just imagine you have a sample .xsd file and this .xsd file will be used to create database tables.Please let me know if you have any queries. Thanks,nick
Just want to make things perfectly work and make the most of our fantastic SQL Server 2005 Data Mining Engine. Can any of you here give me some super advices on the validation of the mining models. As we always see, the 3 aspects of a mining model are: Score, Population correct, and Predict Probability. So the question is: How can we combine these three aspects to best judge the mining models by being able to tell which model is the best one? And to what extent can we really trust these mining models?
These are very important before we can actually bring the models into work to convince other people who have no ideas what are going on with these models. Yes, we just want to convince them with the results of these models and make the most of them and best help them getting the most from their business operations etc.
By the way please can you explain a bit details on each of these aspects? Thanks again.
I am looking forward to hearing from you shortly and thanks bunch for your help.
I am having a problem deploying or manually uploading smdl files to reporting services. I can upload anyother type of file without a problem. (dsv, ds, etc.) However, when i try to upload a smdl file, I get
"The permissions granted to user '<me>' are insufficient for performing this operation. (rsAccessDenied)"
I've been experimenting with the algorithm parameters for a market basket association model. The default MINIMUM_ITEMSET_SIZE is 1. This doesn't seem to make sense: what is the point of a single-member itemset? However changing the value to 2 substantially reduces the proportion of good recommendations obtained (which I'm testing via a holdout approach).
So I'm obviously misunderstanding what the parameter means. Can someone explain it please, and also explain the observation above)?
Using SQL Server 2005 Business Intelligence Studio, I created a Data Source (Test.ds), Data Source View (Test.dsv), and a Report Model (Test.smdl). It is very easy to deploy this model into a Report Server, from the Business Intelligence Studio, by right clicking the Report Model Project and choosing 'Deploy'.
I am have a few reports built in Report Builder and obviously using Report Models. All these report models are using a share data source. When I try enable to enable chache or creating subscription I get and error that "Credentials used not stored".
I thought well let me store the credentials for the data source used by the model used by the report on the server. I still get the same error.
I tried to create a custom data source for the report but there is not option for connection string to connect to Report Server Model. We have "Microsoft SQL SEVER", "OLE DB", "Microsoft SQL Analysis Services", "ORACLE", "ODBC", "XML", "SQP Netweaver BI" and "Hyperion Essabe"
if I understand this right to create cache or subscription I must store the credentials for each report making shared Data source concept redundant. Also I cannot create a connection string to connect to Report Server Model.
I would greatly appreciate if anyone can tell me how to enable cache or subscriptions for report that are built on report models using shared data source.
I need to SELECT something from a database that has a UNIQUEIDENTIFIER (GUID) field,
If the number is wrong (has some other than A-Z 0-9) than the ASP page just freaks out and gets "error converting from a character string to uniqueidentifier"
How can I check that the GUID is OK before I SELECT ? this is the number format: {7A9B5F81-4936-4A31-B4E2-9168AAB75A0}
I tried to cast this "error" number with no successs: "WHERE Deceased_ID = cast('"& "---------4936-4A31-B4E2-9168AAB75A0" &"' as uniqueidentifier)"
I am working on sql integrating with Hybris. Most of you are not aware of hybris as it was a new technology.
In hybris we will create some classes and in those classes we will insert some tables. These tables will be automatically inserted into the sql server. No need to manually insert them the hybris structure will insert it. So my doubt is, how can we find out whether all the records have been inserted into sql database or not how we can check that.
Is there any way?? We have those classes in wich we can see which tables are there. Or else from the data model itself we can see what are the tables avilable or shud be inserted into sql thru hybris.
Can we check by giving all the table names in a single query?? or is there any other way to find out. If we can check by giving all the table names in single query how we need to give it.
Hi,Is there a simple way to verify if a database exists?I'm writing a stored procedure that will accept a database name as an inputparameter,and create the database if it does't already exist.--Message posted via http://www.sqlmonster.com