Search

Jimeng Sun Phones & Addresses

  • Las Vegas, NV
  • Atlanta, GA
  • Cambridge, MA
  • Scarsdale, NY
  • White Plains, NY
  • Elmsford, NY
  • San Jose, CA
  • Pittsburgh, PA

Work

Company: Ibm tj watson research center Jan 2010 Position: Research staff member

Education

Degree: PhD School / High School: Carnegie Mellon University 2003 to 2007 Specialities: Computer Science

Skills

Data Mining • Machine Learning • Algorithms • Text Mining • Data Analysis • Analytics • Predictive Modeling • Predictive Analytics • Databases • Distributed Systems • Hadoop • Informatics • Natural Language Processing • Big Data • Medical Informatics • Visual Analytics • Business Analytics • Data Warehousing • Social Networking • Network Analysis • Health Analytics • Deep Learning

Interests

Data Mining • Machine Learning • Social Network Analysis • Databases

Industries

Research

Resumes

Resumes

Jimeng Sun Photo 1

Professor

View page
Location:
Atlanta, GA
Industry:
Research
Work:
IBM TJ Watson research center since Jan 2010
Research Staff Member

IBM TJ Watson research center Oct 2007 - Dec 2009
Research Staff Member

Carnegie Mellon University 2003 - 2007
PhD student

IBM TJ Watson Research Center May 2006 - Aug 2006
Summer Intern

PricewaterhouseCoopers May 2005 - Aug 2005
Summer Intern
Education:
Carnegie Mellon University 2003 - 2007
PhD, Computer Science
Hong Kong University of Science and Technology - School of Business and Management 1999 - 2003
BS, Computer Science
The Hong Kong University of Science and Technology 1999 - 2002
B.S, computer science
Skills:
Data Mining
Machine Learning
Algorithms
Text Mining
Data Analysis
Analytics
Predictive Modeling
Predictive Analytics
Databases
Distributed Systems
Hadoop
Informatics
Natural Language Processing
Big Data
Medical Informatics
Visual Analytics
Business Analytics
Data Warehousing
Social Networking
Network Analysis
Health Analytics
Deep Learning
Interests:
Data Mining
Machine Learning
Social Network Analysis
Databases

Publications

Us Patents

Preserving Privacy Of One-Dimensional Data Streams By Perturbing Data With Noise And Using Dynamic Autocorrelation

View page
US Patent:
7840516, Nov 23, 2010
Filed:
Feb 26, 2007
Appl. No.:
11/678808
Inventors:
Yuan-Chi Chang - New York NY, US
Feifei Li - Boston MA, US
Spyridon Papadimitriou - White Plains NY, US
George A. Mihaila - Yorktown Heights NY, US
Ioana Stanoi - San Jose CA, US
Jimeng Sun - Pittsburgh PA, US
Philip S. Yu - Chappaqua NY, US
Assignee:
International Business Machines Corporation - Armonk NY
International Classification:
G06F 17/00
US Classification:
706 47
Abstract:
A method, information processing system, and computer readable medium are provided for preserving privacy of one-dimensional nonstationary data streams. The method includes receiving a one-dimensional nonstationary data stream. A set of first-moment statistical values are calculated, for a given instant of sub-space of time, for the data. The first moment statistical values include a principal component for the sub-space of time. The data is perturbed with noise along the principal component in proportion to the first-moment of statistical values so that at least part of a set of second-moment statistical values for the data is perturbed by the noise only within a predetermined variance.

Preserving Privacy Of One-Dimensional Data Streams Using Dynamic Correlations

View page
US Patent:
7853545, Dec 14, 2010
Filed:
Feb 26, 2007
Appl. No.:
11/678786
Inventors:
Yuan-Chi Chang - New York NY, US
Feifei Li - Boston MA, US
Spyridon Papadimitriou - White Plains NY, US
George A. Mihaila - Yorktown Heights NY, US
Ioana Stanoi - San Jose CA, US
Jimeng Sun - Pittsburgh PA, US
Philip S. Yu - Chappaqua NY, US
Assignee:
International Business Machines Corporation - Armonk NY
International Classification:
G06F 17/00
US Classification:
706 47
Abstract:
Disclosed is a method, information processing system, and computer readable medium for preserving privacy of nonstationary data streams. The method includes receiving at least one nonstationary data stream with time dependent data. Calculating, for a given instant of sub-space of time, A set of first-moment statistical values is calculated, for a given instant of sub-space of time, for the data. The first moment statistical values include a principal component for the sub-space of time. The data is perturbed with noise along the principal component in proportion to the first-moment of statistical values so that at least part of a set of second-moment statistical values for the data is perturbed by the noise only within a predetermined variance.

Content-Based And Time-Evolving Social Network Analysis

View page
US Patent:
8204988, Jun 19, 2012
Filed:
Sep 2, 2009
Appl. No.:
12/552812
Inventors:
Ching-Yung Lin - Hawthorne NY, US
Spyridon Papadimitrion - Hawthorne NY, US
Jimeng Sun - Hawthorne NY, US
Assignee:
International Business Machines Corporation - Armonk NY
International Classification:
G06F 15/173
US Classification:
709224, 709220
Abstract:
System and method for modeling a content-based network. The method includes finding single mode clusters from among network (sender and recipient) and content dimensions represented as a tensor data structure. The method allows for derivation of useful cross-mode clusters (interpretable patterns) that reveal key relationships among user communities and keyword concepts for presentation to users in a meaningful and intuitive way. Additionally, the derivation of useful cross-mode clusters is facilitated by constructing a reduced low-dimensional representation of the content-based network. Moreover, the invention may be enhanced for modeling and analyzing the time evolution of social communication networks and the content related to such networks. To this end, a set of non-overlapping or possibly overlapping time-based windows is constructed and the analysis performed at each successive time interval.

System And Method For Composite Distance Metric Leveraging Multiple Expert Judgments

View page
US Patent:
8566268, Oct 22, 2013
Filed:
Mar 23, 2011
Appl. No.:
13/070084
Inventors:
Shahram Ebadollahi - White Plains NY, US
Jimeng Sun - White Plains NY, US
Fei Wang - Ossining NY, US
Assignee:
International Business Machines Corporation - Armonk NY
International Classification:
G06F 17/00
G06N 5/04
US Classification:
706 48
Abstract:
A system and method for a composite distance metric leveraging multiple expert judgments includes inputting a data distribution of multiple expert judgments stored on a computer readable storage medium. Base distance metrics are converted into neighborhoods for comparison, wherein each base distance metric represents an expert. The neighborhoods are combined to leverage the local discriminalities of all base distance metrics by applying at least one iterative process to output a composite distance metric.

Mining Temporal Patterns In Longitudinal Event Data Using Discrete Event Matrices And Sparse Coding

View page
US Patent:
8583586, Nov 12, 2013
Filed:
Jan 21, 2011
Appl. No.:
13/011632
Inventors:
Shahram Ebadollahi - White Plains NY, US
Jianying Hu - Bronx NY, US
Martin S. Kohn - East Hills NY, US
Noah Lee - New York NY, US
Robert K. Sorrentino - Rancho Palos Verdes CA, US
Jimeng Sun - White Plains NY, US
Fei Wang - San Jose CA, US
Assignee:
International Business Machines Corporation - Armonk NY
International Classification:
G06N 5/02
US Classification:
706 53, 706 20
Abstract:
Methods and systems for event pattern mining are shown that include representing longitudinal event data in a measurable geometric space as a temporal event matrix representation (TEMR) using spatial temporal shapes, wherein event data is organized into hierarchical categories of event type and performing temporal event pattern mining with a processor by locating visual event patterns among the spatial temporal shapes of said TEMR using a constraint sparse coding framework.

Systems And Methods For Simultaneous Summarization Of Data Cube Streams

View page
US Patent:
20080168375, Jul 10, 2008
Filed:
Jan 7, 2007
Appl. No.:
11/620679
Inventors:
Spyridon Papadimitriou - White Plains NY, US
Jimeng Sun - Pittsburgh PA, US
Philip S. Yu - Chappaqua NY, US
Assignee:
INTERNATIONAL BUSINESS MACHINES CORPORATION - Armonk NY
International Classification:
G06F 3/048
US Classification:
715772
Abstract:
In an exemplary embodiment, some of the main aspects of the present invention are the following: (i) Data model: We introduce tensor streams to deal with large collections of multi-aspect streams; and (ii) Algorithmic framework: We propose window-based tensor analysis (WTA) to effectively extract core patterns from tensor streams. The tensor representation is related to data cube in On-Line Analytical Processing (OLAP). However, our present invention focuses on constructing simple summaries for each window, rather than merely organizing the data to produce simple aggregates along each aspect or combination of aspects.

Analyzing Parallel Topics From Correlated Documents

View page
US Patent:
20110202484, Aug 18, 2011
Filed:
Feb 18, 2010
Appl. No.:
12/708053
Inventors:
Nikolaos Anerousis - Chappaqua NY, US
Abhijit Bose - Paramus NJ, US
Jimeng Sun - White Plains NY, US
Duo Zhang - Urbana IL, US
Assignee:
INTERNATIONAL BUSINESS MACHINES CORPORATION - Armonk NY
International Classification:
G06F 15/18
G06N 5/02
US Classification:
706 12, 706 52
Abstract:
Access is obtained to a parallel corpus including a problem corpus and a solution corpus. A first plurality of topics are mined from the problem corpus and a second plurality of topics are mined from the solution corpus. A transition probability from the first plurality of topics to the second plurality of topics is determined, to identify a most appropriate one of the topics from the solution corpus for a given one of the topics from the problem corpus.

System And Method For Predicting Near-Term Patient Trajectories

View page
US Patent:
20120041277, Feb 16, 2012
Filed:
Aug 12, 2010
Appl. No.:
12/855068
Inventors:
SHAHRAM EBADOLLAHI - Tarrytown NY, US
Jianying Hu - Bronx NY, US
Robert K. Sorrentino - Rancho Palos Verdes CA, US
Daby M. Sow - Croton On Hudson NY, US
Jimeng Sun - White Plains NY, US
Assignee:
INTERNATIONAL BUSINESS MACHINES CORPORATION - Armonk NY
International Classification:
A61B 5/00
US Classification:
600301
Abstract:
A system and method for predicting near term measurements of a patient includes a stream processor configured to summarize raw measurements from patients into signatures and construct optimal prediction models based on previously obtained signatures. A similar patient tracker is configured to monitor similar patient information for a query patient. The similar patient information is determined based on a similarity between the query patient and signatures of other patients. A model analyzer is configured to employ retrofitted optimal prediction models from similar patients to predict near term measurements of the query patient.
Jimeng Sun from Las Vegas, NV, age ~44 Get Report