SIGKDD(KDD) 2012 论文列表
The 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '12, Beijing, China, August 12-16, 2012.
|
PubMed search and exploration with real-time semantic network construction.
EvaPlanner: an evacuation planner with social-based flocking kinetics.
EventSearch: a system for event discovery and retrieval on multi-type historical data.
A system for extracting top-K lists from the web.
VOXSUP: a social engagement framework.
HeteRecom: a semantic-based recommendation systemin heterogeneous networks.
Navigating information facets on twitter (NIF-T).
Siren: an interactive tool for mining and visualizing geospatial redescriptions.
GeoSearch: georeferenced video retrieval system.
AssocExplorer: an association rule visualization system for exploratory data analysis.
Intelligent advertising framework for digital signage.
MoodLens: an emoticon-based sentiment analysis system for chinese tweets.
Information propagation game: a tool to acquire humanplaying data for multiplayer influence maximization on social networks.
D-INDEX: a web environment for analyzing dependences among scientific collaborators.
TourViz: interactive visualization of connection pathways in large graphs.
Visual exploration of collaboration networks based on graph degeneracy.
UFIMT: an uncertain frequent itemset mining toolbox.
DAGger: clustering correlated uncertain data (to predict asset failure in energy networks).
Query-driven discovery of semantically similar substructures in heterogeneous networks.
BC-PDM: data mining, social network analysis and text mining system based on cloud computing.
On "one of the few" objects.
Fast algorithms for comprehensive n-point correlation estimates.
Mining discriminative components with low-rank and sparsity constraints for face recognition.
On nested palindromes in clickstream data.
Efficient frequent item counting in multi-core hardware.
An enhanced relevance criterion for more concise supervised pattern discovery.
Automatic taxonomy construction from keywords.
LIEGE: : link entities in web lists with knowledge base.
Latent association analysis of document pairs.
Large-scale learning of word relatedness with constraints.
Similarity search in real world networks.
Understanding users' satisfaction for search engine evaluation.
Information processing in social networks.
Social media data analysis for revealing collective behaviors.
SmartDispatch: enabling efficient ticket dispatch in an IT service environment.
A framework for robust discovery of entity synonyms.
Storytelling in entity networks to support intelligence analysts.
PatentMiner: topic-driven patent analysis and mining.
Design principles of massive, robust prediction systems.
Integrating meta-path selection with user-guided object clustering in heterogeneous information networks.
Active spectral clustering via iterative uncertainty reduction.
Locally-scaled spectral clustering using empty region graphs.
Chromatic correlation clustering.
Two approaches to understanding when constraints help clustering.
Learning personal + social latent factor model for social recommendation.
RecMax: exploiting recommender systems for fun and profit.
Cross-domain collaboration recommendation.
Incorporating heterogeneous information for personalized tag recommendation in social tagging systems.
Circle-based recommendation in online social networks.
Mining coherent subgraphs in multi-layer graphs with edge labels.
Summarization-based mining bipartite graphs.
Fast algorithms for maximal clique enumeration with limited memory.
RolX: structural role extraction & mining in large graphs.
Streaming graph partitioning for large distributed graphs.
Online allocation of display ads with smooth delivery.
Factoring past exposure in display advertising targeting.
SHALE: an efficient algorithm for allocation of guaranteed display advertising.
The untold story of the clones: content-agnostic factors that impact YouTube video popularity.
Joint optimization of bid and budget allocation in sponsored search.
Experiences and lessons in developing industry-strength machine learning and data mining software.
SympGraph: a framework for mining clinical notes through symptom relation graphs.
RainMon: an integrated approach to mining bursty timeseries monitoring data.
Multi-source learning for joint analysis of incomplete multi-modality neuroimaging data.
An integrated data mining approach to real-time clinical monitoring and deterioration warning.
Active sampling for entity matching.
Metro maps of science.
Stratified k-means clustering over a deep web data source.
Open domain event extraction from twitter.
Modeling disease progression via fused sparse group lasso.
Multi-domain active learning for text classification.
Transductive multi-label ensemble classification for protein function prediction.
Web image prediction using multivariate point processes.
Adversarial support vector machine learning.
Anonymizing set-valued data by nonreciprocal recoding.
Differential identifiability.
Event-based social networks: linking the online and offline social worlds.
Towards social user profiling: unified and discriminative influence model for inferring home locations.
Finding trendsetters in information networks.
Capacitated team formation problem on social networks.
Leveraging predictive modeling to reduce signal theft in a multi-service organization environment.
Ensembles and model delivery for tax compliance.
Following the electrons: methods for power management in commercial buildings.
HySAD: a semi-supervised hybrid shilling attack detector for trustworthy product recommendation.
Coupled behavior analysis for capturing coupling relationships in group-based market manipulations.
Empowering authors to diagnose comprehension burden in textbooks.
Random forests for metric learning with implicit pairwise position dependence.
On socio-spatial group query for location-based social networks.
A probabilistic model for multimodal hash function learning.
Maximum inner-product search using cone trees.
Feature grouping and selection over an undirected graph.
Model mining for robust feature selection.
Unsupervised feature selection for linked social media data.
Robust multi-task feature learning.
Intrusion as (anti)social communication: characterization and detection.
A near-linear time approximation algorithm for angle-based outlier detection in high-dimensional data.
Different slopes for different folks: mining for exceptional regression models with cook's distance.
Integrating community matching and outlier detection for mining evolutionary community outliers.
Discovering value from community activity on focused question answering sites: a case study of stack overflow.
Mining contentions from discussions and debates.
Selecting a characteristic set of reviews.
Review spam detection via temporal pattern discovery.
Cross-media knowledge discovery.
Bayesian relational data analysis.
Experience with discovering knowledge by acquiring it.
Algorithms for mining uncertain graph data.
Bid optimizing and inventory scoring in targeted online advertising.
Position-normalized click prediction in search advertising.
Trustworthy online controlled experiments: five puzzling outcomes explained.
Multimedia features for click prediction of new ads in display advertising.
Estimating conversion rate in display advertising from past erformance data.
Efficient evaluation of large sequence kernels.
SPF-GMKL: generalized multiple kernel learning with a million kernels.
Batch mode active sampling based on marginal probability distribution matching.
Semi-supervised learning with mixed knowledge information.
Parallel field ranking.
Playlist prediction via metric embedding.
Online learning to diversify from implicit feedback.
ComSoc: adaptive transfer of user behaviors over composite social network.
Estimating entity importance via counting set covers.
Transparent user models for personalization.
Mining large-scale, sparse GPS traces for map inference: comparison of approaches.
USpan: an efficient algorithm for mining high utility sequential patterns.
SeqiBloc: mining multi-time spanning blockmodels in dynamic graphs.
Testing the significance of spatio-temporal teleconnection patterns.
Discovering lag intervals for temporal dependencies.
On the separability of structural classes of communities.
DEMON: a local-first discovery method for overlapping communities.
Overlapping community detection via bounded nonnegative matrix tri-factorization.
Vertex neighborhoods, low conductance cuts, and good seeds for local community methods.
Magnet community identification on social networks.
Key lessons learned building recommender systems for large-scale social networks.
Semantic search and a new moore's law effect in knowledge engineering.
Bootstrapped language identification for multi-site internet domains.
Harnessing the wisdom of the crowds for accurate web page clipping.
Keyword-propagation-based information enriching and noise removal for web news videos.
Scalable misbehavior detection in online video chat services.
Inductive multi-task learning with multiple view data.
Rank-loss support instance machines for MIML instance annotation.
Multi-label hypothesis reuse.
A structural cluster kernel for learning on graphs.
Low rank modeling of signed networks.
Learning binary codes for collaborative filtering.
Large-scale distributed non-negative sparse coding and sparse dictionary learning.
Optimal exact least squares rank minimization.
Efficient event pattern matching with match windows.
The long and the short of it: summarising event sequences with serial episodes.
Towards heterogeneous temporal clinical event pattern discovery: a convolutional approach.
Mining event periodicity from incomplete observations.
Aggregating web offers to determine product prices.
Interacting viruses in networks: can both survive?
Discriminative clustering for market segmentation.
Efficient and domain-invariant competitor mining.
Maximizing return and minimizing cost with the right decision management systems.
China's national personal credit scoring system: a real-life intelligent knowledge application.
Finding trending local topics in search queries for personalization of a recommendation system.
Community discovery and profiling with social messages.
Entity-centric topic-oriented opinion summarization in twitter.
A framework for summarizing and analyzing twitter feeds.
Dependency clustering across measurement scales.
Subspace correlation clustering: finding locally correlated dimensions in subspace projections of the data.
Detecting changes of clustering structures using normalized maximum likelihood coding.
A sparsity-inducing formulation for evolutionary co-clustering.
Active learning for online bayesian matrix factorization.
GigaTensor: scaling tensor analysis up by 100 times - algorithms and discoveries.
Fast bregman divergence NMF using taylor expansion and coordinate descent.
Accelerated singular value thresholding for matrix completion.
A shapelet transform for time series classification.
Mining recent temporal patterns for event detection in multivariate time series data.
Fast mining and forecasting of complex time-stamped events.
Searching and mining trillions of time series subsequences under dynamic time warping.
eTrust: understanding trust evolution in an online world.
From user comments to on-line conversations.
Social sampling.
Learning from crowds in the presence of schools of thought.
Developing data mining applications.
A new challenge of information processing under the 21st century.
Building an engine for big data.
Interaction and collective intelligence in internet computing.
Differentially private transit data publication: a case study on the montreal transportation system.
GetJar mobile application recommendations with very sparse datasets.
Constructing popular routes from uncertain trajectories.
Discovering regions of different functions in a city using human mobility and POIs.
Linear support vector machines via dual cached loops.
Learning in non-stationary environments with class imbalance.
NASA: achieving lower regrets and faster rates via adaptive stepsizes.
Intelligible models for classification and regression.
A simple methodology for soft cost-sensitive classification.
Multi-view clustering using mixture models in subspace projections.
TM-LDA: efficient online modeling of latent topic transitions in social media.
Overlapping decomposition for causal graphical modeling.
Practical collapsed variational bayes inference for hierarchical dirichlet process.
The contextual focused topic model.
Sampling minimal frequent boolean (DNF) patterns.
Mining top-K high utility itemsets.
Linear space direct pattern sampling using coupling from the past.
Mining emerging patterns by streaming feature selection.
Finding minimum representative pattern sets.
The missing models: a data-driven approach for learning how networks grow.
Information diffusion and external influence in networks.
PageRank on an evolving graph.
Efficient personalized pagerank with accuracy assurance.
Rise and fall patterns of information diffusion: model and implications.
Experiments in social computation: (and the data they generate).
Divide-and-conquer and statistical inference for big data.
Mining heterogeneous information networks: the next frontier.
Nine real hard problems we'd like you to solve.