kddata
Home
Book
Videos
Q&A
Sign in

Ask a new question
 Sort:
by views
or
by time
Questions: 170

Resolved: 170

Answers: 263

Views : 90054

Avg time to answer : 11.3 days
38710
views
1149
days
1
answers
Getting an error when running KNN algorithm on Project
844
views
1171
days
6
answers
Not Enough Distinct Predictions to compute area under ROC.
684
views
1211
days
4
answers
Error: undefined columns selected, when running dummy(x)
674
views
1182
days
3
answers
R error: infinite or missing values in 'x' when running svd(subs_mat)
618
views
1149
days
1
answers
Error from AUC function on KNN algorithm
591
views
1155
days
1
answers
How to manually compute KNN
585
views
1130
days
1
answers
Random Forest error on calibration function
568
views
1150
days
10
answers
Error running LR with glmnet
498
views
1203
days
1
answers
Large Graded Partner Assignment Question
467
views
1149
days
1
answers
GLM/GLMNET Multinomial/Binomial error
457
views
1179
days
3
answers
Question about the NYSE dataset for Large Assignment  heavily skewed dependent variable
440
views
1129
days
1
answers
WHAT DATA
422
views
1143
days
6
answers
Error Received in Decision Tree
419
views
1211
days
1
answers
Can you clarify the following statement: Compute the the sum by both ID1 and ID2 ? HW1 Exercise4 (due 2/1)
414
views
1157
days
7
answers
Euclidean Distance for KNearest Neighbors
395
views
1211
days
4
answers
How to keep the same structure of a data frame when appying functions to it?
380
views
1130
days
1
answers
Text mining after SVD
368
views
1180
days
1
answers
How to use your newly computed v and d to calculate u on the second half of the data?
366
views
1176
days
1
answers
Meaning of negative AUC (in Round 1 results)
364
views
1128
days
7
answers
predKNN Error: "Error in matrix(data = predKNN, ncol = k, nrow = nrow(testKNN)) : nonnumeric matrix extent"
357
views
1206
days
1
answers
How to compute a lag on a variable?
356
views
1208
days
1
answers
How to compute the time window in model deployment?
356
views
1207
days
1
answers
What is the difference between characters, numerics, factors, and integers?
354
views
1206
days
1
answers
Best Way to Attempt Homework
354
views
1177
days
3
answers
HW#2: Calculating Second Half of SUBS dataset
348
views
1180
days
1
answers
Output of the KNN algorithm question > Prediction or probability
348
views
1157
days
4
answers
Create a bagged tree and tune the ensemble size
347
views
1181
days
3
answers
Warning in install.packages : package â€˜imputeâ€™ is not available (for R version 3.2.3)
346
views
1179
days
1
answers
Prediction Algorithm Output and what it means: If all my predNB values are under 0.50, is it ever guessing "YES?"
345
views
1142
days
1
answers
RandomForest Prediction in Project
341
views
1207
days
1
answers
did you mean to say flag or lag in the exercise?
339
views
1170
days
1
answers
Lead variable for predictions
334
views
1158
days
2
answers
2.8.4 Exercises for Subsection 2.5.5
333
views
1206
days
1
answers
Can you please clarify what you are asking for when you want predictors from 5 columns?
333
views
1128
days
1
answers
Output for 5x2fcv function?
332
views
1115
days
1
answers
Logistic Regression Log Lambda / Coefficients Graph Explanation
329
views
1157
days
3
answers
Max AUC (bagged trees)
328
views
1179
days
1
answers
Large Assignment: Input / Output and function deliverables
328
views
1156
days
1
answers
KNearest Neighbors "for loop"
326
views
1177
days
1
answers
How do we write the predict function to give predictions for specific stocks?
323
views
1131
days
1
answers
Optimizing bins for calibration
323
views
1184
days
1
answers
How to manually compute proportions of 1s using k nearest neighbors in R?
320
views
1172
days
1
answers
Coaching Marketplace: Can coaches meet with different teams?
318
views
1170
days
2
answers
Threading in R
318
views
1181
days
1
answers
What is the purpose of svd?
317
views
1157
days
1
answers
kNN Calculated Proportions
317
views
1170
days
3
answers
Predicting Using Naive Bayes
314
views
1185
days
1
answers
What is the response variable for the Group Stock Price Project?
310
views
1123
days
1
answers
Passing variables between functions
309
views
1138
days
1
answers
Big Project Articles Text mining: Merging 2 data sets
309
views
1154
days
1
answers
Leading variable
307
views
1172
days
1
answers
Question sapply about
306
views
1170
days
2
answers
AUC
305
views
1144
days
1
answers
Bagged Trees Prediction
305
views
1157
days
1
answers
taking Y of the nearest neighbors
303
views
1115
days
1
answers
Exam length
301
views
1120
days
1
answers
Summarizing CrossValidation Performance Values
301
views
1157
days
1
answers
What is meant by "Estimation phase: Create a big ensemble with many trees only once"
301
views
1123
days
2
answers
DATA_GRADING data from 2002 missing
299
views
1137
days
2
answers
How to execute R code inside a string variable?
296
views
1171
days
5
answers
Reading .csv files as Dataframe
295
views
1184
days
1
answers
Breaking a "tie" for Knearest neighbors
291
views
1121
days
1
answers
Comprising a 5x2cfv
290
views
1154
days
1
answers
Why use AUC?
290
views
1172
days
1
answers
What to do if a rm command was used when it shouldn't have been?
289
views
1184
days
1
answers
Why do we sometimes use the transpose function back to back, and how do we recognize when it is appropriate to do so?
289
views
1150
days
1
answers
Difference between BasetableTRAIN, BasetableVAL, BasetableTEST, and BasetableTRAINbig
288
views
1154
days
1
answers
Dependent Variable
286
views
1172
days
1
answers
kNN.index vs. knn
286
views
1177
days
1
answers
How to use validation method on two separate data sets (test data= jan2nd & train data=prev.yr data+jan1st data)
284
views
1130
days
1
answers
How to select number of dates in test set
283
views
1116
days
1
answers
Explain node impurity further
282
views
1156
days
1
answers
Using NaiveBayes() function that does not have the right criteria
282
views
1207
days
1
answers
What does putting "as." in front of numeric or factor accomplish?
281
views
1141
days
1
answers
Large assignment question about merging
281
views
1147
days
1
answers
Text mining: how to aggregate unstructured data in Large Assignment News Articles
280
views
1206
days
1
answers
How to combine two functions with differing outputs and equations?
280
views
1170
days
2
answers
Sorting by stock symbol
280
views
1135
days
1
answers
Is it better to delete row of NA's or impute mode?
279
views
975
days
1
answers
Test Material Usage
279
views
1172
days
1
answers
kNN in class dependent variable
279
views
1129
days
3
answers
Not enough memory to run RF on the Stocks data
277
views
1157
days
2
answers
HW#3 Question 1: Medium Tenure, High Spend
277
views
1206
days
1
answers
Homework due dates
276
views
1158
days
1
answers
Second intermediate deliverable: predictions
275
views
1130
days
1
answers
Choosing a good title
274
views
1131
days
1
answers
Does the WilcoxonRanks Test function need to only work for 10 AUC scores at the .05 level?
274
views
1121
days
1
answers
AUC ROC calculation explored
274
views
998
days
2
answers
Viewing Prep Quiz answers
274
views
1171
days
3
answers
Coaching Marketplace
273
views
1149
days
1
answers
submitting LR
273
views
1158
days
1
answers
Finding the Optimal # of Trees
270
views
1143
days
1
answers
Using the mapping.csv file with Article Data
270
views
1132
days
1
answers
Using Random Forest
269
views
1124
days
3
answers
Reading in new data
269
views
1000
days
1
answers
Viewing of the Solutions to the InClass Gradable
268
views
1126
days
1
answers
what does it mean  return indicators
268
views
1155
days
1
answers
Discriminating between neighbors in Knearest neighbors where distance measures are equal
267
views
1172
days
1
answers
First Prediction Wednesday Model
265
views
1161
days
1
answers
Predicting Probabilities of New (X,Y) Using Naive Bayes
265
views
976
days
1
answers
Defining the classes of columns while reading in data
265
views
1128
days
1
answers
Number of bins for plotting Calibration and bin classifier scores
264
views
1154
days
1
answers
Independent and dependent variables for predictions
264
views
1143
days
3
answers
Tuning in Decision Trees
263
views
1123
days
2
answers
How to combine multiple models for final submission
262
views
1123
days
1
answers
When to install.packages for grader
262
views
1142
days
1
answers
Round 3 Data
262
views
1126
days
1
answers
Clarification on Final Code Deliverable
261
views
1143
days
1
answers
Round 4 Why are there so many unmatched Symbols?
261
views
1123
days
2
answers
New data passed with ensemble of models?
261
views
1135
days
1
answers
yTEST
260
views
1118
days
2
answers
Requesting a summary of results from Stock Prediction final submission
260
views
1156
days
1
answers
Computing Probabilities manually with Bayes Theorem.
260
views
1185
days
1
answers
What is the significance of document vector weighting?
260
views
1123
days
1
answers
Matching symbols with predictions in the predict function
259
views
1127
days
1
answers
What do we do if our calibrated predictions are incorrect?
259
views
1145
days
3
answers
Determining optimal number of trees  Exercise 2.8.6
259
views
1135
days
1
answers
Manually Evaluate Neural Network
258
views
945
days
1
answers
Second Test Allowed Material Usage
256
views
1154
days
1
answers
How to manually calculate AUC?
255
views
1122
days
1
answers
Should I subtract colMeans from my DTM matrix when I am recreating it in the predict function?
255
views
1132
days
2
answers
The datasets.Rdata file location
254
views
1145
days
1
answers
Neural Network Calculation: Entropy Term, etc.
254
views
1131
days
1
answers
What's the difference between Predictions and Predictions 2
254
views
1122
days
1
answers
Splitting a Table Without Hardcoding
253
views
1153
days
1
answers
Loop with letters instead of numbers
253
views
1143
days
1
answers
Optimal Ensemble Size Exercise 2.8.6 for section 2.5.7
251
views
1129
days
1
answers
How do we structure a data frame for stacking models?
251
views
1143
days
1
answers
Defining Validation data vs training data
249
views
1005
days
5
answers
Computer Error on the First Prep Quiz
248
views
1127
days
1
answers
Getting a better smoothed fit for plot
247
views
1110
days
1
answers
Cross Validation Optimal Parameter Consistency
246
views
820
days
3
answers
what do you mean when you use the word 'handling'?
243
views
1142
days
1
answers
Inner merging DF with cardinality of manytomany
241
views
1123
days
1
answers
columns.txt file in Data Grading folder
240
views
1124
days
1
answers
MERGE() command Techniques
239
views
1126
days
1
answers
Wilcoxon assingment
239
views
1110
days
2
answers
Cross Validation & Wilcoxon Solutions
239
views
1135
days
1
answers
NAs in Dataset from Lead Variable
236
views
940
days
1
answers
"idf" function
231
views
1126
days
1
answers
Wilcoxon signedranks test: Value of parameter "p" in the critical values table
229
views
983
days
1
answers
How would && operators increase efficiency?
227
views
1127
days
1
answers
graphing function takes labels as argument
225
views
1126
days
1
answers
Deploying the model given by the calibrate function on unbinned or binned data?
219
views
914
days
1
answers
When do we perform variable selection outside of regression?
202
views
897
days
1
answers
How select the sequence parameter to use in a loop while trying to determine the optimal K for a KNN model
202
views
747
days
1
answers
Support Vector Machines and figure 2.15
194
views
807
days
1
answers
Figure 2.8 and equation 2.24
192
views
854
days
1
answers
Data Frame subsetted with logical types FALSE, TRUE
190
views
774
days
1
answers
Features in Support Vector Machines
187
views
789
days
3
answers
Do we need to train the model on train+test set again after finding optimal ensemble size?
185
views
807
days
2
answers
Good Predictions
183
views
809
days
2
answers
In binomial likelihood function why is number of trials equal to 1
181
views
787
days
1
answers
Bagged Decision Tree vs. Random Forest
180
views
809
days
1
answers
The difference between using predict() on BasetableVAL and BasetableTEST
180
views
813
days
3
answers
Naive Bayes: How can we tell when to tune data?
180
views
816
days
1
answers
Viewing files before reading into R using notepad
179
views
807
days
1
answers
Double colon?
176
views
812
days
1
answers
Creating identical percentages of 1's and 0's efficiently
174
views
778
days
2
answers
ICA #19 Solution
173
views
817
days
1
answers
combining rbind and lapply  the reasoning behind not using them together
172
views
782
days
1
answers
Something forgotten before an important dealine
170
views
760
days
1
answers
Understanding classifier performance
158
views
768
days
1
answers
AUACC vs AUROC curves
67
views
294
days
0
answers
67
views
294
days
0
answers
47
views
294
days
0
answers
45
views
294
days
0
answers
43
views
294
days
0
answers
41
views
294
days
0
answers