cikm61

cikm 2012 论文列表

21st ACM International Conference on Information and Knowledge Management, CIKM'12, Maui, HI, USA, October 29 - November 02, 2012.

DOLAP 2012 workshop summary.
WIDM 2012: the 12th international workshop on web information and data management.
PIKM 2012: 5th ACM workshop for PhD students in information and knowledge management.
First international workshop on information and knowledge management for developing region.
Fifth workshop on exploiting semantic annotations in information retrieval: ESAIR"12).
Workshop on multimodal crowd sensing (CrowdSens 2012).
PLEAD 2012: politics, elections and data.
DTMBIO 2012: international workshop on data and text mining in biomedical informatics.
Booksonline'12: 5th workshop on online books, complementary social media and their impact.
SHB 2012: international workshop on smart health and wellbeing.
The 2012 international workshop on web-scale knowledge representation, retrieval, and reasoning.
Managing interoperability and compleXity in health systems - MIXHS'12.
CDMW 2012 - city data management workshop: workshop summary.
CloudDB 2012: fourth international workshop on cloud data management.
DUBMMSM'12: international workshop on data-driven user behavioral modeling and mining from social media.
AMADA: web data repositories in the amazon cloud.
Primates: a privacy management system for social networks.
STFMap: query- and feature-driven visualization of large time series data sets.
MADden: query-driven statistical text analytics.
HadoopXML: a suite for parallel processing of massive XML data with multiple twig pattern queries.
Demonstrating ProApproX 2.0: a predictive query engine for probabilistic XML.
The nautilus analyzer: understanding and debugging data transformations.
Exploration of monte-carlo based probabilistic query processing in uncertain graphs.
MAGIK: managing completeness of data.
MOUNA: mining opinions to unveil neglected arguments.
Simultaneous realization of page-centric communication and search.
Gumshoe quality toolkit: administering programmable search.
TASE: a time-aware search engine.
PicAlert!: a system for privacy-aware image classification and retrieval.
Mixed-initiative conversational system using question-answer pairs mined from the web.
Cager: a framework for cross-page search.
ESA: emergency situation awareness via microbloggers.
CrowdTiles: presenting crowd-based information for event-driven information needs.
A summarization tool for time-sensitive social media.
A tool for automated evaluation of algorithms.
InCaToMi: integrative causal topic miner between textual and non-textual time series data.
Supporting temporal analytics for health-related events in microblogs.
CarbonDB: a semantic life cycle inventory database.
Lonomics Atlas: a tool to explore interconnected ionomic, genomic and environmental data.
4Is of social bully filtering: identity, inference, influence, and intervention.
PRAVDA-live: interactive knowledge harvesting.
LUKe and MIKe: learning from user knowledge and managing interactive knowledge extraction.
Fast and accurate incremental entity resolution relative to an entity knowledge base.
Latent topics in graph-structured data.
Continuous top-k query for graph streams.
Similarity search in 3D object-based video data.
Enabling ontology based semantic queries in biomedical database systems.
Probabilistic ranking in fuzzy object databases.
Spatial-aware interest group queries in location-based social networks.
Information-complete and redundancy-free keyword search over large data graphs.
Contextual evaluation of query reformulations in a search session by user simulation.
Evaluating reward and risk for vertical selection.
Recency-sensitive model of web page authority.
Estimating query difficulty for news prediction retrieval.
Exploring simultaneous keyword and key sentence extraction: improve graph-based ranking using wikipedia.
SRGSIS: a novel framework based on social relationship graph for social image search.
Finding food entity relationships using user-generated data in recipe service.
Is wikipedia too difficult?: comparative analysis of readability of wikipedia, simple wikipedia and britannica.
A scalable approach for performing proximal search for verbose patent search queries.
Learning to recommend with social relation ensemble.
Where do the query terms come from?: an analysis of query reformulation in collaborative web search.
Predicting primary categories of business listings for local search.
Data filtering in humor generation: comparative analysis of hit rate and co-occurrence rankings as a method to choose usable pun candidates.
The face of quality in crowdsourcing relevance labels: demographics, personality and labeling accuracy.
RESQ: rank-energy selective query forwarding for distributed search systems.
PhotoFall: discovering weblog stories through photographs.
Topic based pose relevance learning in dance archives.
A latent pairwise preference learning approach for recommendation from implicit feedback.
Session-based query performance prediction.
On the usefulness of query features for learning to rank.
Demographic context in web search re-ranking.
An examination of content farms in web search using crowdsourcing.
Predicting CTR of new ads via click prediction.
Extracting interesting association rules from toolbar data.
Concavity in IR models.
Twitter hyperlink recommendation with user-tweet-hyperlink three-way clustering.
TwiSent: a multistage system for analyzing sentiment in twitter.
Climbing the app wall: enabling mobile app discovery through context-aware recommendations.
Large scale analysis of changes in english vocabulary over recent time.
A new probabilistic model for top-k ranking problem.
Short-text domain specific key terms/phrases extraction using an n-gram model with wikipedia.
A picture paints a thousand words: a method of generating image-text timelines.
Exploring the cluster hypothesis, and cluster-based retrieval, over the web.
Relation regularized subspace recommending for related scientific articles.
Improving the performance of the reinforcement learning model for answering complex questions.
I want what i need!: analyzing subjectivity of online forum threads.
Temporal models for microblogs.
Information preservation in static index pruning.
Survival analysis for freshness in microblogging search.
Enhancing product search by best-selling prediction in e-commerce.
How do humans distinguish different people with identical names on the web?
Question-answer topic model for question retrieval in community question answering.
On active learning in hierarchical classification.
Learning to rank search results for time-sensitive queries.
Query-performance prediction and cluster ranking: two sides of the same coin.
Coarse-to-fine sentence-level emotion classification based on the intra-sentence features and sentential context.
Predicting the performance of passage retrieval for question answering.
Bridging offline and online social graph dynamics.
A constraint to automatically regulate document-length normalisation.
An evaluation of corpus-driven measures of medical concept similarity for information retrieval.
On the inference of average precision from score distributions.
Hierarchical image annotation using semantic hierarchies.
Language processing for arabic microblog retrieval.
Semantically coherent image annotation with a learning-based keyword propagation strategy.
Fast candidate generation for two-phase document ranking: postings list intersection with bloom filters.
Towards measuring the visualness of a concept.
Serial position effects of clicking behavior on result pages returned by search engines.
Mathematical equation retrieval using plain words as a query.
An evaluation and enhancement of densitometric fragmentation for content slicing reuse.
Selecting expansion terms as a set via integer linear programming.
Dictionary based sparse representation for domain adaptation.
Hierarchical target type identification for entity-oriented queries.
An unsupervised method for author extraction from web pages containing user-generated content.
Automatic labeling hierarchical topics.
A co-training based method for chinese patent semantic annotation.
Composing activity groups in social networks.
Scalable collaborative filtering using incremental update and local link prediction.
Tweet classification based on their lifetime duration.
Entity resolution using search engine results.
Finding influential products on social domination game.
On using category experts for improving the performance and accuracy in recommender systems.
Parallel proximal support vector machine for high-dimensional pattern classification.
Mining advices from weblogs.
Top-N recommendation through belief propagation.
An efficient and simple under-sampling technique for imbalanced time series classification.
Prediction of retweet cascade size over time.
Tracing clusters in evolving graphs with node attributes.
A word-order based graph representation for relevance identification.
Graph-based collective classification for tweets.
On compressing weighted time-evolving graphs.
Text classification with relatively small positive documents and unlabeled data.
Time feature selection for identifying active household members.
Infobox suggestion for Wikipedia entities.
An interaction framework of service-oriented ontology learning.
On empirical tradeoffs in large scale hierarchical classification.
Dual word and document seed selection for semi-supervised sentiment classification.
Learning to predict the cost-per-click for your ad words.
Weighted linear kernel with tree transformed features for malware detection.
Maximizing revenue from strategic recommendations under decaying trust.
Information propagation in social rating networks.
The twitaholic next door.: scalable friend recommender system using a concept-sensitive hash function.
Accelerating locality preserving nonnegative matrix factorization.
A tensor encoding model for semantic processing.
Polygene-based evolution: a novel framework for evolutionary algorithms.
Clustering short text using Ncut-weighted non-negative matrix factorization.
An effective category classification method based on a language model for question category recommendation on a cQA service.
Outlier detection using centrality and center-proximity.
A tag-centric discriminative model for web objects classification.
Importance weighted passive learning.
Learning to rank for hybrid recommendation.
On bundle configuration for viral marketing in social networks.
A probabilistic approach to correlation queries in uncertain time series data.
Scaling multiple-source entity resolution using statistically efficient transfer learning.
Fast PCA computation in a DBMS with aggregate UDFs and LAPACK.
A new tool for multi-level partitioning in teradata.
Applying weighted queries on probabilistic databases.
Optimizing data migration for cloud-based key-value stores.
Adapt: adaptive database schema design for multi-tenant applications.
Star-Join: spatio-textual similarity join.
Loyalty-based selection: retrieving objects that persistently satisfy criteria.
Impact neighborhood indexing (INI) in diffusion graphs.
Author-conference topic-connection model for academic network search.
Efficient distributed locality sensitive hashing.
Real-time aggregate monitoring with differential privacy.
A positional access method for relational databases.
Efficient estimation of dynamic density functions with an application to outlier detection.
Location selection for utility maximization with capacity constraints.
Credibility-based product ranking for C2C transactions.
Keyword-based k-nearest neighbor search in spatial databases.
CloST: a hadoop-based storage system for big spatio-temporal data analytics.
Clustering Wikipedia infoboxes to discover their types.
An efficient index for massive IOT data in cloud environment.
Finding the optimal path over multi-cost graphs.
On skyline groups.
Efficient buffer management for piecewise linear representation of multiple data streams.
SliceSort: efficient sorting of hierarchical data.
LINDA: distributed web-of-data-scale entity matching.
Diversifying query results on semi-structured data.
Discovering conditional inclusion dependencies.
Schema-free structured querying of DBpedia data.
Efficient logging for enterprise workloads on column-oriented in-memory databases.
Sort-based query-adaptive loading of R-trees.
Top-k retrieval using conditional preference networks.
Fast top-k similarity queries via matrix compression.
Theme chronicle model: chronicle consists of timestamp and topical words over each theme.
Mining sentiment terminology through time.
Multi-session re-search: in pursuit of repetition and diversification.
Predicting web search success with fine-grained interaction data.
SonetRank: leveraging social networks to personalize search.
Stochastic simulation of time-biased gain.
GTE: a distributional second-order co-occurrence approach to improve the identification of top relevant dates in web snippets.
User activity profiling with multi-layer analysis.
User guided entity similarity search using meta-path selection in heterogeneous information networks.
Sentiment-focused web crawling.
Modeling browsing behavior for click analysis in sponsored search.
Query recommendation for children.
A unified optimization framework for auction and guaranteed delivery in online advertising.
Characterizing web search queries that match very few or no results.
You should read this! let me explain you why: explaining news recommendations to users.
The downside of markup: examining the harmful effects of CSS and javascript on indexing today's web.
Automatic query expansion based on tag recommendation.
Detecting offensive tweets via topical feature discovery over a large scale twitter corpus.
Full-text citation analysis: enhancing bibliometric and scientific publication ranking.
Map to humans and reduce error: crowdsourcing for deduplication applied to digital libraries.
Differences in effectiveness across sub-collections.
Location-sensitive resources recommendation in social tagging systems.
Entity centric query expansion for enterprise search.
A comprehensive analysis of parameter settings for novelty-biased cumulative gain.
PolariCQ: polarity classification of political quotations.
Search result presentation based on faceted clustering.
CONSENTO: a new framework for opinion based entity search and summarization.
Learning from mistakes: towards a correctable learning algorithm.
Mining noisy tagging from multi-label space.
Discovering logical knowledge for deep question answering.
Query-biased learning to rank for real-time twitter search.
Recommending citations: translating papers into references.
BiasTrust: teaching biased users about controversial topics.
Collaborative ranking: improving the relevance for tail queries.
Towards jointly extracting aspects and aspect-specific sentiment knowledge.
Structured query reformulations in commerce search.
Task tours: helping users tackle complex search tasks.
From sBoW to dCoT marginalized encoders for text representation.
Federated search in the wild: the combined power of over a hundred search engines.
Interactive and context-aware tag spell check and correction.
Sketch-based indexing of n-words.
Semantic context learning with large-scale weakly-labeled image set.
Leveraging tagging for neighborhood-aware probabilistic matrix factorization.
Ranking news events by influence decay and information fusion for media and users.
Exploiting concept hierarchy for result diversification.
Do ads compete or collaborate?: designing click models with full relationship incorporated.
Quality models for microblog retrieval.
Customizing search results for non-native speakers.
Interest-matching information propagation in multiple online social networks.
Finding nuggets in IP portfolios: core patent mining through textual temporal analysis.
More than relevance: high utility query recommendation by mining users' search behaviors.
Variance maximization via noise injection for active sampling in learning to rank.
On the connections between explicit semantic analysis and latent semantic analysis.
Query likelihood with negative query generation.
Discover breaking events with popular hashtags in twitter.
Diversionary comments under political blog posts.
Automatic image annotation using tag-related random search over visual neighbors.
Estimating interleaved comparison outcomes from historical click data.
Trust prediction via aggregating heterogeneous social networks.
Content-based relevance estimation on the web using inter-document similarities.
A hybrid approach for efficient provenance storage.
Dynamic effects of ad impressions on commercial actions in display advertising.
Balanced coverage of aspects for text summarization.
Web-scale multi-task feature selection for behavioral targeting.
Using program synthesis for social recommendations.
Joint bilingual name tagging for parallel corpora.
Providing grades and feedback for student summaries by ontology-based information extraction.
A probabilistic approach to mining geospatial knowledge from social annotations.
Degree relations of triangles in real-world networks and graph models.
Real-time bid optimization for group-buying ads.
Community-based classification of noun phrases in twitter.
Measuring website similarity using an entity-aware click graph.
SemaFor: semantic document indexing using semantic forests.
Relational co-clustering via manifold ensemble learning.
The early-adopter graph and its application to web-page recommendation.
Shaping communities out of triangles.
WiSeNet: building a wikipedia-based semantic network with ontologized relations.
iSampling: framework for developing sampling methods considering user's interest.
Topic-sensitive probabilistic model for expert finding in question answer communities.
Time-aware topic recommendation based on micro-blogs.
Query-focused multi-document summarization based on query-sensitive feature space.
Exploring the existing category hierarchy to automatically label the newly-arising topics in cQA.
Unsupervised discovery of opposing opinion networks from forum discussions.
PathRank: a novel node ranking measure on a heterogeneous graph for recommender systems.
Preprocessing of informal mathematical discourse in context ofcontrolled natural language.
PriSM: discovering and prioritizing severe technical issues from product discussion forums.
Incorporating word correlation into tag-topic model for semantic knowledge acquisition.
Exploiting enriched contextual information for mobile app classification.
Hierarchical topic integration through semi-supervised hierarchical topic modeling.
PRemiSE: personalized news recommendation via implicit social experts.
If you are happy and you know it... tweet.
Frequent grams based embedding for privacy preserving record linkage.
What is happening right now ... that interests me?: online topic discovery and recommendation in twitter.
Swimming against the streamz: search and analytics over the enterprise activity stream.
gSCorr: modeling geo-social correlations for new check-ins on location-based social networks.
Empirical validation of the buckley-osthus model for the web host graph: degree and edge distributions.
Discretionary social network data revelation with a user-centric utility guarantee.
Meta path-based collective classification in heterogeneous information networks.
Mining topic-level opinion influence in microblog.
Mining coherent anomaly collections on web data.
Discovering personally semantic places from GPS trajectories.
Exploiting latent relevance for relational learning of ubiquitous things.
Effective and efficient?: bilingual sentiment lexicon extraction using collocation alignment.
Efficient extraction of ontologies from domain specific text corpora.
Reconciling ontologies and the web of data.
Graph-based workflow recommendation: on improving business process modeling.
Extraction of topic evolutions from references in scientific articles and its GPU acceleration.
A simple approach to the design of site-level extractors using domain-centric principles.
Measuring robustness of complex networks under MVC attack.
Learning spectral embedding via iterative eigenvalue thresholding.
Automatically embedding newsworthy links to articles.
Fast approximation of steiner trees in large graphs.
Joint relevance and answer quality learning for question routing in community QA.
Adapting vector space model to ranking-based collaborative filtering.
Feature selection based on term frequency and T-test for text categorization.
Mining long-lasting exploratory user interests from search history.
Hierarchical co-clustering based on entropy splitting.
GRAFT: an approximate graphlet counting algorithm for large graph analysis.
Influence and similarity on heterogeneous networks.
The walls have ears: optimize sharing for visibility and privacy in online social networks.
Evaluating geo-social influence in location-based social networks.
Mining competitive relationships by learning across heterogeneous networks.
Social recommendation across multiple relational domains.
Knowing where and how criminal organizations operate using web content.
Efficient jaccard-based diversity analysis of large document collections.
A unified learning framework for auto face annotation by mining web facial images.
Model the complex dependence structures of financial variables by using canonical vine.
Authentication of moving range queries.
Generically extending anonymization algorithms to deal with successive queries.
Efficient provenance storage for relational queries.
Robust distributed indexing for locality-skewed workloads.
A practical concurrent index for solid-state drives.
Iterative relevance feedback with adaptive exploration/exploitation trade-off.
Exploring and predicting search task difficulty.
Improving bag-of-visual-words model with spatial-temporal correlation for video retrieval.
The effect of aggregated search coherence on search behavior.
Generating facets for phone-based navigation of structured data.
CoNet: feature generation for multi-view semi-supervised learning with partially observed views.
Modeling semantic relations between visual attributes and object categories via dirichlet forest prior.
Learning to discover complex mappings from web forms to ontologies.
Automated feature weighting in naive bayes for high-dimensional data classification.
A novel local patch framework for fixing supervised learning models.
You can stop early with COLA: online processing of aggregate queries in the cloud.
Predicting the effectiveness of keyword queries on databases.
Deco: declarative crowdsourcing.
Efficient influence-based processing of market research queries.
CGStream: continuous correlated graph query for data streams.
Joint topic modeling for event summarization across news and social media streams.
Role-explicit query identification and intent role annotation.
Supporting factual statements with evidence from the web.
G-WSTD: a framework for geographic web search topic discovery.
Labeling by landscaping: classifying tokens in context by pruning and decorating trees.
Crosslingual distant supervision for extracting relations of different complexity.
Segmenting web-domains and hashtags using length specific models.
Active learning for relation type extension with local and global data views.
Non-stationary bayesian networks based on perfect simulation.
An effective rule miner for instance matching in a web of data.
Comprehension-based result snippets.
Processing continuous text queries featuring non-homogeneous scoring functions.
An automatic blocking mechanism for large-scale de-duplication tasks.
Domain dependent query reformulation for web search.
Click patterns: an empirical representation of complex query intents.
Leaving so soon?: understanding and predicting web search abandonment rationales.
Towards optimum query segmentation: in doubt without.
Acquiring temporal constraints between relations.
Predicting aggregate social activities using continuous-time stochastic process.
TUT: a statistical model for detecting trends, topics and user interests in social media.
Spatial influence vs. community influence: modeling the global spread of social media.
Pay-as-you-go maintenance of precomputed nearest neighbors in large graphs.
Monochromatic and bichromatic reverse nearest neighbor queries on land surfaces.
Efficient safe-region construction for moving top-K spatial keyword queries.
Finding top k most influential spatial facilities over uncertain objects.
Being picky: processing top-k queries with set-defined selections.
Completeness of queries over SQL databases.
GPU acceleration of probabilistic frequent itemset mining from uncertain databases.
On the foundations of probabilistic information integration.
What is the IQ of your data transformation system?
A model-based approach for RFID data stream cleansing.
Learning to rank duplicate bug reports.
Learning to rank by aggregating expert preferences.
Learning to rank for robust question answering.
Back to the roots: a probabilistic framework for query-performance prediction.
Predicting query performance for fusion-based retrieval.
On the design of LDA models for aspect-based opinion mining.
Two-part segmentation of text documents.
Modeling topic hierarchies with the recursive chinese restaurant process.
The generalized dirichlet distribution in enhanced topic detection.
TCSST: transfer classification of short & sparse text using external data.
Temporal corpus summarization using submodular word coverage.
Understanding book search behavior on the web.
Contextualization using hyperlinks and internal hierarchical structure of Wikipedia documents.
A math-aware search engine for math question answering system.
Towards an effective and unbiased ranking of scientific literature through mutual reinforcement.
A decentralized recommender system for effective web credibility assessment.
Multi-faceted ranking of news articles using post-read actions.
The efficient imputation method for neighborhood-based collaborative filtering.
Exploring personal impact for group recommendation.
Metaphor: a system for related search recommendations.
Right-protected data publishing with hierarchical clustering preservation.
Improving document clustering using automated machine translation.
Document-topic hierarchies from document graphs.
Maximum margin clustering on evolutionary data.
Scalable clustering of signed networks using balance normalized cut.
Matching product titles using web-based enrichment.
Large-scale item categorization for e-commerce.
Influence propagation in adversarial setting: how to defeat competition with least amount of investment.
Enabling direct interest-aware audience selection.
Daily-deal selection for revenue maximization.
Shard ranking and cutoff estimation for topically partitioned collections.
KORE: keyphrase overlap relatedness for entity disambiguation.
Efficient retrieval of recommendations in a matrix factorization framework.
Diversity in blog feed retrieval.
Sequential selection of correlated ads by POMDPs.
The wisdom of advertisers: mining subgoals via query clustering.
Visual appearance of display ads and its effect on click through rate.
Multiview hierarchical bayesian regression model andapplication to online advertising.
Delineating social network data anonymization via random edge perturbation.
From face-to-face gathering to social structure.
Collective intelligence in the online social network of yahoo!answers and its implications.
Predicting emerging social conventions in online social networks.
Utilizing common substructures to speedup tensor factorization for mining dynamic graphs.
TALMUD: transfer learning for multiple domains.
Fast and reliable anomaly detection in categorical data.
Local anomaly descriptor: a robust unsupervised algorithm for anomaly detection based on diffusion space.
Indexing uncertain spatio-temporal data.
Location-aware instant search.
Leveraging read rates of passive RFID tags for real-time indoor location tracking.
A filter-based protocol for continuous queries over imprecise location data.
Decomposition-by-normalization (DBN): leveraging approximate functional dependencies for efficient tensor decomposition.
A graph-based approach for ontology population with named entities.
G-SPARQL: a hybrid engine for querying large attributed graphs.
Efficient algorithms for generalized subgraph query processing.
RDF pattern matching using sortable views.
Interpreting keyword queries over web knowledge bases.
Cross-argument inference for implicit discourse relation recognition.
Fast multi-task learning for query spelling correction.
Visualizing timelines: evolutionary summarization via iterative reinforcement between text and image streams.
Topic-driven reader comments summarization.
One seed to find them all: mining opinion features via association.
Gelling, and melting, large graphs by edge manipulation.
Density index and proximity search in large graphs.
An analysis of how ensembles of collective classifiers improve predictions in graphs.
Multi-scale link prediction.
Graph classification: a diversified discriminative feature selection approach.
Content-based crowd retrieval on the real-time web.
Social book search: comparing topical relevance judgements and book suggestions for evaluation.
Generating event storylines from microblogs.
Making your interests follow you on twitter.
Twevent: segment-based event detection from tweets.
Constructing test collections by inferring document relevance via extracted relevant information.
Incorporating variability in user behavior into systems based evaluation.
Alternative assessor disagreement and retrieval depth.
On caption bias in interleaving experiments.
An analysis of systematic judging errors in information retrieval.
Interactive pattern mining on hidden data: a sampling-based solution.
PARMA: a parallel randomized algorithm for approximate association rules mining in MapReduce.
Incorporating occupancy into frequent pattern mining for high quality pattern recommendation.
A general framework to encode heterogeneous information sources for contextual pattern mining.
Mining high utility itemsets without candidate generation.
Social contextual recommendation.
MEET: a generalized framework for reciprocal recommender systems.
Dynamic covering for recommendation systems.
DQR: a probabilistic approach to diversified query recommendation.
LogUCB: an explore-exploit algorithm for comments recommendation.
Compressed data structures with relevance.
Learning similarity measures based on random walks.
User engagement: the network effect matters!