cikm51

cikm 2011 论文列表

Proceedings of the 20th ACM Conference on Information and Knowledge Management, CIKM 2011, Glasgow, United Kingdom, October 24-28, 2011.

Social and collaborative information seeking: panel.
DOLAP 2011: overview of the 14th international workshop on data warehousing and olap.
LSDS-IR'11: the 9th workshop on large-scale and distributed systems for information retrieval.
Fourth workshop on exploiting semantic annotations in information retrieval (ESAIR).
Search and mining entity-relationship data.
Report on the third international workshop on cloud datamanagement (CloudDB 2011).
Managing interoperability and complexity inhealth systems: MIXHS'11 workshop summary.
PIKM 2011: the 4th ACM workshop for Ph.D. students in information and knowledge management.
DESIRE 2011: first international workshop on data infrastructures for supporting information retrieval evaluation.
3rd international workshop on collaborative information retrieval (CIR2011).
Web science and information exchange in the medical web.
Overview of the third international workshop on search and mining user-generated contents.
4th international workshop on patent information retrieval (PaIR'11).
Detect'11: international workshop on DETecting and Exploiting Cultural diversiTy on the social web.
BooksOnline'11: 4th workshop on online books, complementary social media, and crowdsourcing.
DTMBIO 2011: international workshop on data and textmining in biomedical informatics.
Uncertain schema matching: the power of not knowing.
Object ranking.
Information retrieval challenges in computational advertising.
Information diffusion in social networks: observing and affecting what society cares about.
Advances in data stream mining for mobile and ubiquitous environments.
Web-based open-domain information extraction.
Statistical information retrieval modelling: from the probability ranking principle to recent advances in diversity, portfolio theory, and beyond.
Large-scale information retrieval experimentation with terrier.
Large-scale array analytics: taming the data tsunami.
Computational geography.
P2Prec: a social-based P2P recommendation system.
PICASSO: automated soundtrack suggestion for multi-modal data.
Entity timelines: visual analytics and named entity evolution.
Annotating knowledge work lifelog: term extraction from sensor and operation history.
Health conversational system based on contextual matching of community-driven question-answer pairs.
H-DB: a hybrid quantitative-structural sql optimizer.
MEMSCALE: in-cluster-memory databases.
PDFMeat: managing publications on the semantic desktop.
Fu-Finder: a game for studying querying behaviours.
Interactive reasoning in uncertain RDF knowledge bases.
Conkar: constraint keyword-based association discovery.
RoSeS: a continuous query processor for large-scale RSS filtering and aggregation.
Scalable similarity search of timeseries with variable dimensionality.
Jasmine: a real-time local-event detection system based on geolocation information propagated to microblogs.
Marco Polo: a system for brand-based shopping and exploration.
Editing knowledge resources: the wiki way.
An integrated environment for semantic knowledge work.
Data-thirsty business analysts need SODA: search over data warehouse.
A data mining system based on SQL queries and UDFs for relational databases.
Black swan: augmenting statistics with event data.
Exploratory search over social-medical data.
Coarse-to-fine classification via parametric and nonparametric models for computer-aided diagnosis.
Simultaneously improving CSAT and profit in a retail banking organization.
A machine-learned proactive moderation system for auction fraud detection.
Accurate information extraction for quantitative financial events.
Domain customization for aspect-oriented opinion analysis with multi-level latent sentiment clues.
Sentiment classification via l2-norm deep belief network.
Predicting the uncertainty of sentiment adjectives in indirect answers.
OpinioNetIt: understanding the opinions-people network for politically controversial topics.
Question identification on twitter.
The where in the tweet.
Imbalanced sentiment classification.
Enhancing accessibility of microblogging messages using semantic knowledge.
Classifying trending topics: a typology of conversation triggers on Twitter.
Leveraging web 2.0 data for scalable semi-supervised learning of domain-specific sentiment lexicons.
A pretopological framework for the automatic construction of lexical-semantic structures from texts.
Insert-friendly XML containment labeling scheme.
AWETO: efficient incremental update and querying in rdf storage system.
Efficient association discovery with keyword-based constraints on large graph data.
Folksonomy-based term extraction for word cloud generation.
An algorithm for axiom pinpointing in EL+ and its incremental variant.
ONTOCUBE: efficient ontology extraction using OLAP cubes.
A taxonomy of local search: semi-supervised query classification driven by information needs.
Rule-based construction of matching processes.
Efficient query rewrite for structured web queries.
k-Nearest neighbor query processing method based on distance relation pattern.
Subject-oriented top-k hot region queries in spatial dataset.
A continuous query evaluation scheme for a detection-only query over data streams.
PCMLogging: reducing transaction logging overhead with PCM.
Block-based load balancing for entity resolution with MapReduce.
A cluster based mobile peer to peer architecture in wireless ad hoc networks.
Continuous data stream query in the cloud.
On the elasticity of NoSQL databases over cloud management platforms.
Defining isochrones in multimodal spatial networks.
Top-k most influential locations selection.
Processing the signature quadratic form distance on many-core GPU architectures.
Integrating and querying web databases and documents.
A robust index for regular expression queries.
Collection-based compression using discovered long matching strings.
Predicting the optimal ad-hoc index for reachability queries on graph databases.
Scalable entity matching computation with materialization.
Probabilistic model for discovering topic based communities in social networks.
Towards noise-resilient document modeling.
WikiLabel: an encyclopedic approach to labeling documents en masse.
Named entity recognition using a modified Pegasos algorithm.
A geographic study of tie strength in social media.
SILA: a spatial instance learning approach for deep webpages.
Mining frequent patterns across multiple data streams.
More or better: on trade-offs in compacting textual problem solution repositories.
Two birds with one stone: learning semantic models for text categorization and word sense disambiguation.
Detection of text quality flaws as a one-class classification problem.
Examining the "leftness" property of Wikipedia categories.
Suggesting ghost edges for a smaller world.
On selection of objective functions in multi-objective community detection.
DIGRank: using global degree to facilitate ranking in an incomplete graph.
Authormagic: an approach to author disambiguation in large-scale digital libraries.
LSH based outlier detection and its application in distributed setting.
Switch detector: an activity spotting system for desktop.
Privacy preserving feature selection for distributed data using virtual dimension.
Utility-driven anonymization in data publishing.
More influence means less work: fast latent dirichlet allocation by influence scheduling.
YANA: an efficient privacy-preserving recommender system for online social communities.
A semi-supervised hybrid system to enhance the recommendation of channels in terms of campaign roi.
User oriented tweet ranking: a filtering approach to microblogs.
Structured collaborative filtering.
Improving k-nearest neighbors algorithms: practical application of dataset analysis.
Review recommendation: personalized prediction of the quality of online reviews.
Discovering trending phrases on information streams.
CoRankBayes: bayesian learning to rank under the co-training framework and its application in keyphrase extraction.
Constructing efficient information extraction pipelines.
Fast supervised feature extraction by term discrimination information pooling.
Building a generic debugger for information extraction pipelines.
Joint inference for cross-document information extraction.
Towards expert finding by leveraging relevant categories in authority ranking.
Mining query structure from click data: a case study of product queries.
A diversity measure leveraging domain specific auxiliary information.
KLEAP: an efficient cleaning method to remove cross-reads in RFID streams.
Learning kernels with upper bounds of leave-one-out error.
Latent feature encoding using dyadic and relational data.
Using random walks for multi-label classification.
Hierarchy evolution for improved classification.
A partitioning method for symbolic interval data based on kernelized metric.
Promotional subspace mining with EProbe framework.
Finding redundant and complementary communities in multidimensional networks.
Representing document as dependency graph for document clustering.
A probabilistic approach to nearest-neighbor classification: naive hubness bayesian kNN.
Transfer active learning.
Structured data classification by means of matrix factorization.
Do they belong to the same class: active learning by querying pairwise label homogeneity.
Attention prediction on social media brand pages.
Collaborative blacklist generation via searches-and-clicks.
Citation chain aggregation: an interaction model to support citation cycling.
Spectral analysis of a blogosphere.
Beyond precision@10: clustering the long tail of web search results.
Collaborative exploratory search in real-world context.
A personalized recommendation system on scholarly publications.
User action interpretation for personalized content optimization in recommender systems.
Supervised matching of comments with news article segments.
Advertiser-centric approach to understand user click behavior in sponsored search.
Inferring query aspects from reformulations using clustering.
Relative effect of spam and irrelevant documents on user interaction with search engines.
Beyond relevance in marketplace search.
Leveraging Wikipedia concept and category information to enhance contextual advertising.
Constructing seminal paper genealogy.
Efficient retrieval of 3D building models using embeddings of attributed subgraphs.
Image clustering fusion technique based on BFS.
Robust video fingerprinting based on hierarchical symmetric difference feature.
Tightly coupling visual and linguistic features for enriching audio-based web browsing experience.
Re-ranking by local re-scoring for video indexing and retrieval.
Efficient lp-norm multiple feature metric learning for image categorization.
Context-aware query recommendation by learning high-order relation in query logs.
Supervised language modeling for temporal resolution of texts.
Learning to rank categories for web queries.
Predicting document effectiveness in pseudo relevance feedback.
Learning to rank with cross entropy.
Smoothing NDCG metrics using tied scores.
CoDet: sentence-based containment detection in news corpora.
Fact-based question decomposition for candidate answer re-ranking.
Question routing in community question answering: putting category in its place.
Automatic query reformulation with syntactic operators to alleviate search difficulty.
CQC: classifying questions in CQA websites.
Learning to recommend questions based on public interest.
A novel framework of training hidden markov support vector machines from lightly-annotated data.
Extracting adjective facets from community Q&A corpus.
Recommending citations with translation model.
Semantic convolution kernels over dependency trees: smoothed partial tree kernel.
Topic modeling for named entity queries.
Trained trigger language model for sentence retrieval in QA: bridging the vocabulary gap.
Efficient phrase querying with flat position index.
Index tuning for query-log based on-line index maintenance.
When close enough is good enough: approximate positional indexes for efficient ranked retrieval.
An unsupervised ranking method based on a technical difficulty terrain.
Adaptive term frequency normalization for BM25.
Hybrid models for future event prediction.
Diverse retrieval via greedy optimization of expected 1-call@k in a latent subtopic relevance model.
On relevance, time and query expansion.
Selecting related terms in query-logs using two-stage SimRank.
On bias problem in relevance feedback.
Insights into explicit semantic analysis.
Relevance feedback exploiting query-specific document manifolds.
Patent query reduction using pseudo relevance feedback.
Recency ranking by diversification of result set.
A nugget-based test collection construction paradigm.
Worker types and personality traits in crowdsourcing relevance labels.
Effectiveness beyond the first crawl tier.
Google, bing and a new perspective on ranking similarity.
Understanding the types of information humans associate with geographic objects.
An efficient method for using machine translation technologies in cross-language patent search.
Item categorization in the e-commerce domain.
HealthTrust: trust-based retrieval of you tube's diabetes channels.
RerankEverything: a reranking interface for exploring search results.
A peer's-eye view: network term clouds in a peer-to-peer system.
Diversification for multi-domain result sets.
Search result diversification for enterprise data.
Privacy protected knowledge management in services with emphasis on quality data.
Extract knowledge from semi-structured websites for search task simplification.
Information extraction from pathology reports in a hospital setting.
Generating links to background knowledge: a case study using narrative radiology reports.
Exploring the corporate ecosystem with a semi-supervised entity graph.
Enriching textbooks with images.
Effects of search success on search engine re-use.
Social ranking for spoken web search.
Evolving social search based on bookmarks and status messages from social networks.
Large-scale behavioral targeting with a social twist.
Learning to target: what works for behavioral targeting.
CP-index: on the efficient indexing of large graphs.
Fast fully dynamic landmark-based estimation of shortest path distances in very large graphs.
Skynets: searching for minimum trees in graphs with incomparable edge weights.
DELTA: indexing and querying multi-labeled graphs.
High efficiency and quality: large graphs matching.
Spreadsheet-based complex data transformation.
RFID data analysis using tensor calculus for supply chain management.
Approximate tensor decomposition within a tensor-relational algebraic framework.
Cost-efficient repair in inconsistent probabilistic databases.
Context-based entity description rule for entity resolution.
The quality of the XML web.
TEXplorer: keyword-based object search and exploration in multidimensional text databases.
Adding structure to top-k: from items to expansions.
Learning to rank results in relational keyword search.
Efficient similarity search: arbitrary similarity measures, arbitrary composition.
Ranking support for keyword search on structured data using relevance models.
Provenance-based refresh in data-oriented workflows.
Supporting queries spanning across phases of evolving artifacts using Steiner forests.
A parallel algorithm for computing borders.
Tractable XML data exchange via relations.
I/O-efficient algorithms for answering pattern-based aggregate queries in a sequence OLAP system.
On benchmarking data translation systems for semantic-web ontologies.
Context-based people search in labeled social networks.
The list Viterbi training algorithm and its application to keyword search over databases.
Answering label-constraint reachability in large graphs.
Matching query processing in high-dimensional space.
Authentication of location-based skyline queries.
Multiple keyword-based queries over XML streams.
Continuously monitoring the correlations of massive discrete streams.
Advancing the discovery of unique column combinations.
Semantic data markets: a flexible environment for knowledge management.
Information re-finding by context: a brain memory inspired approach.
ReDRIVE: result-driven database exploration through recommendations.
Categorising logical differences between OWL ontologies.
Learning-based relevance feedback for web-based relation completion.
XQuery optimization based on program slicing.
Optimized processing of multiple aggregate continuous queries.
Index structures and top-k join algorithms for native keyword search databases.
Evaluation of set-based queries with aggregation constraints.
Semi-indexing semi-structured data in tiny space.
Efficient methods for finding influential locations with adaptive grids.
Finding information nebula over large networks.
Effective stratification for low selectivity queries on deep web data sources.
Efficient resource attribute retrieval in RDF triple stores.
Estimating selectivity for joined RDF triple patterns.
Finding all justifications of OWL entailments using TMS and MapReduce.
Facilitating pattern discovery for relation extraction with semantic-signature-based clustering.
Filtering and clustering relations for unsupervised information extraction in open domain.
Automated feature generation from structured knowledge.
Sparse structured probabilistic projections for factorized latent spaces.
Discovering customer intent in real-time for streamlining service desk conversations.
Behavior-driven clustering of queries into topics.
External evaluation measures for subspace clustering.
Simultaneous joint and conditional modeling of documents tagged from two perspectives.
Asking what no one has asked before: using phrase similarities to generate synthetic web search queries.
Perspective hierarchical dirichlet process for user-tagged image modeling.
Hierarchical tag visualization and application for tag recommendations.
Large-scale question classification in cQA by leveraging Wikipedia semantic knowledge.
Finding dimensions for queries.
Max margin learning on domain-independent web information extraction.
Mining entity translations from comparable corpora: a holistic graph mapping approach.
Enabling information extraction by inference of regular expressions from sample entities.
Fast metadata-driven multiresolution tensor decomposition.
Towards a unified solution: data record region detection and segmentation.
Extracting collective expectations about the future from large text collections.
Extracting cross references from life science databases for search result ranking.
Citation count prediction: learning to estimate future citations for literature.
Combining machine learning and human judgment in author disambiguation.
Studying how the past is remembered: towards computational history through large scale text mining.
Plagiarism detection based on structural information.
Classification and annotation in social corpora using multiple relations.
Distributed social graph embedding.
Extracting multi-dimensional relations: a generative model of groups of entities in a corpus.
Detecting anomalies in graphs with numeric labels.
Determining the diameter of small world networks.
Practical representations for web and social graphs.
Towards feature selection in network.
Temporal link prediction by integrating content and structure information.
Structural link analysis and prediction in microblogs.
Exploiting longer cycles for link prediction in signed networks.
Link prediction: the power of maximal entropy random walk.
Who will follow you back?: reciprocal relationship prediction.
Collective prediction with latent graphs.
Probabilistic near-duplicate detection using simhash.
MTopS: scalable processing of continuous top-k multi-query workloads.
Pattern change discovery between high dimensional data sets.
Correlated multi-label feature selection.
Scalable density-based subspace clustering.
A query-based multi-document sentiment summarizer.
Polarity analysis of texts using discourse structure.
Using games with a purpose and bootstrapping to create domain-specific sentiment lexicons.
A cross-domain adaptation method for sentiment classification using probabilistic latent analysis.
Language-independent sentiment classification using three common words.
Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach.
Do all birds tweet the same?: characterizing twitter around the world.
Connecting users with similar interests via tag network inference.
Mining direct antagonistic communities in explicit trust networks.
CASINO: towards conformity-aware social influence analysis in online social networks.
Improving user interest inference from social neighbors.
Content based social behavior prediction: a multi-task learning approach.
Discovering top-k teams of experts with/without a leader in social networks.
Feature selection using hierarchical feature clustering.
Coupled nominal similarity in unsupervised learning.
Memory-less unsupervised clustering for data streaming by versatile ellipsoidal function.
Semi-supervised multi-task learning of structured prediction models for web information extraction.
Toward interactive training and evaluation.
Can irrelevant data help semi-supervised learning, why and how?
Privacy preservation by independent component analysis and variance control.
Recommendation in the end-to-end encrypted domain.
Privacy preserving indexing for eHealth information networks.
Privacy-aware querying over sensitive trajectory data.
Cloning for privacy protection in multiple independent data publications.
Summarizing web forum threads based on a latent topic propagation process.
Accounting for data dependencies within a hierarchical dirichlet process mixture model.
Learning conditional random fields with latent sparse features for acronym expansion finding.
From names to entities using thematic context distance.
Towards a top-down and bottom-up bidirectional approach to joint information extraction.
Harvesting facts from textual web sources by constrained label propagation.
Optimising ontology stream reasoning with truth maintenance system.
e-NSP: efficient negative sequential pattern mining based on identified positive patterns without database rescanning.
CLUES: a unified framework supporting interactive exploration of density-based clusters in streams.
Toward traffic-driven location-based web search.
Coupling or decoupling for KNN search on road networks?: a hybrid framework on user query patterns.
LogSig: generating system events from raw textual logs.
Transferring topical knowledge from auxiliary long texts for short text clustering.
Natural event summarization.
Focusing on novelty: a crawling strategy to build diverse language models.
Emerging topic detection using dictionary learning.
Diversification and refinement in collaborative filtering recommender.
Modeling personalized email prioritization: classification-based and regression-based approaches.
Assisting web search users by destination reachability.
Timing when to buy.
Bayesian latent variable models for collaborative item rating prediction.
Designing an ensemble classifier over subspace classifiers using iterative convergence routine.
TAKES: a fast method to select features in the kernel space.
Robust nonnegative matrix factorization using L21-norm.
A pairwise ranking based approach to learning with positive and unlabeled examples.
Semi-supervised SVMs for classification with unknown class proportions and a small labeled dataset.
Evaluating an associative browsing model for personal information.
Prioritizing relevance judgments to improve the construction of IR test collections.
Local computation of PageRank: the ranking side.
Click the search button and be happy: evaluating direct and immediate information access.
Simulating simple user behavior for system effectiveness evaluation.
Learning to rank audience for behavioral targeting in display ads.
A language model approach to capture commercial intent and information relevance for sponsored search.
Retrieval models for audience selection in display advertising.
Using query log and social tagging to refine queries based on latent topics.
A framework for personalized and collaborative clustering of search results.
Context-aware search personalization with concept preference.
Exploring categorization property of social annotations for information retrieval.
Content-driven detection of campaigns in social media.
Effective retrieval of resources in folksonomies using a new tag similarity measure.
Workload-aware indexing for keyword search in social networks.
Building directories for social tagging systems.
Towards a framework for attribute retrieval.
A linear-time approximation of the earth mover's distance.
Adaptive parallel approximate similarity search for responsive multimedia retrieval.
Retrieving and ranking unannotated images through collaboratively mining online search results.
This image smells good: effects of image information scent in search engine results pages.
Partial duplicate detection for large book collections.
Indexes for highly repetitive document collections.
SISP: a new framework for searching the informative subgraph based on PSO.
Duplicate detection through structure optimization.
One is enough: distributed filtering for duplicate elimination.
Text vs. space: efficient geo-search query processing.
Location-aware click prediction in mobile local search.
Personalizing web search results by reading level.
What and how children search on the web.
Legal document clustering with built-in topic segmentation.
Sentiment classification based on supervised latent n-gram analysis.
Effective and efficient polarity estimation in blogs based on sentence-level evidence.
Passage retrieval for incorporating global evidence in sequence labeling.
Statistical source expansion for question answering.
Implementation techniques for large-scale latent semantic indexing applications.
TOPSIG: topology preserving document signatures.
Factorization-based lossless compression of inverted indices.
SIMD-based decoding of posting lists.
Efficiently encoding term co-occurrences in inverted indexes.
Efficiency optimizations for interpolating subqueries.
Structured learning of two-level dynamic rankings.
Collaborative online learning of user generated content.
Simultaneous clustering of multi-type relational data via symmetric nonnegative matrix tri-factorization.
Semi-supervised learning to rank with preference regularization.
Intent-aware query similarity.
A probabilistic method for inferring preferences from clicks.
Frequency-aware similarity measures: why Arnold Schwarzenegger is always a duplicate.
Keyword search over RDF graphs.
Ranking-based processing of SQL queries.
Tag clouds revisited.
Coreference aware web object retrieval.
Learning to aggregate vertical results into web search results.
Learning to rank user intent.
Finding images of difficult entities in the long tail.
Searching microblogs: coping with sparsity and document quality.
Reranking search results for sparse queries.
Interactive sense feedback for difficult queries.
Discovering missing click-through query language information for web search.
Query session detection as a cascade.
Query sampling for learning data fusion.
Multi-view random walk framework for search task discovery from click-through log.
A task level metric for measuring web search satisfaction and its application on improving relevance estimation.
Improving context-aware query classification via adaptive self-training.
Suggestion set utility maximization using session logs.
Relevance weighting using within-document term statistics.
Diversifying search results of controversial queries.
User browsing behavior-driven web crawling.
Discovering URLs through user feedback.
Assigning documents to master sites in distributed search.
Unsupervised transactional query classification based on webpage form understanding.
Finding relevant information of certain types from enterprise data.
S3K: seeking statement-supporting top-K witnesses.
Improving retrieval accuracy of difficult queries through generalizing negative document language models.
A quasi-synchronous dependence model for information retrieval.
Lower-bounding term frequency normalization.
Ontology-based data management.
Data, health, and algorithmics: computational challenges for biomedicine.
Creating user interfaces that entice people to manage better information.