Karim Abou-Moustafa

Research Scientist
Amazon.com Inc.


Vita |  Research |  Publications |  Awards |  Activities |  Contact 

View Karim Abou-Moustafa's Google Schoolar profile    View Karim Abou-Moustafa's ORCID profile    View Karim Abou-Moustafa's dplp profile    View Karim Abou-Moustafa's LinkedIn profile


[Vita]

As of January 2020, I'm a research scientist in the Alexa AI team at Amazon.com in Seattle, Washington. I earned my Ph.D. in Electrical and Computer Engineering from McGill University in 2012.


[Research]

I work on various aspects of statistical learning algorithms and their confluence with problems in high-dimensional data analysis and pattern recognition. Topics that I have worked on include linear dimensionality reduction, metric learning, manifold learning, kernel methods, and generative-discriminative models for time-series data.

Recently, I have been working on some aspects of learning theory, in particular, stability of learning algorithms and its ability to derive empirical, yet strong guarantees on the generalization performance of learning algorithms. This is intimately related to questions of model selection and error estimation in applied machine learning scenarios.

[Updates]

  • I'm moving to Amazon Alexa AI team in January 2020!
  • Two papers accepted; one for ALT 2019 and one for AAAI 2019. In a joint work with Csaba Szepesvári, we derive new high probability bounds for the concentration of the leave-one-out estimate, and the empirical estimate, using a weak but realizable notion of algorithmic stability. A long version (with all proofs) was accepted for ALT 2019, and a short version was accepted for AAAI 2019.

  • A preliminary result on using exponential Efron-Stein inequalities to derive high probability bounds for the k-folds cross validation estimate is on arXiv [arXiv:1706.05801]. A joint work with Csaba Szepesvári.


[Preprints]

  • Karim Abou-Moustafa and Csaba Szepesvári
    "An a Priori Exponential Tail Bound for k-Folds Cross-Validation",
    [arXiv:1706.05801], 2017.

[Refereed Publications]

  • Karim Abou-Moustafa and Csaba Szepesvári
    "An Exponential Efron-Stein Inequality for Lq Stable Learning Rules",
    Algorithmic Learning Theory (ALT), Proceedings of Machine Learning Research, Vol. 98, pp. 31-63, 2019.
    [Online Link] [BibTeX] [PDF]

  • Karim Abou-Moustafa and Csaba Szepesvári
    "An Exponential Tail bound for the Deleted Estimate",
    Thirty-Third AAAI Conference on Artificial Intelligence, 2019.
    [BibTeX] [PDF]

  • Karim Abou-Moustafa and Frank Ferrie
    "Local Generalized Quadratic Distance Metrics: Application to the k-nearest neighbors classifier",
    Journal of Advances in Data Analysis and Classification (ADAC), Vol. 12, No. 2, pp. 341-363, 2018.
    [Online Link] [BibTeX] [PDF]

  • Karim Abou-Moustafa and Dale Schuurmans
    "Generalization in Unsupervised Learning",
    European Conference on Machine Learning (ECML), 2015.
    [BibTeX] [PDF]

  • Karim Abou-Moustafa, Fernando De La Torre, and Frank Ferrie
    "Pareto Models for Multiclass Discriminative Linear Dimensionality Reduction",
    Pattern Recognition, Vol. 48, No. 5, pp. 1863-1877, 2015.
    [BibTeX] [PDF]

  • Karim Abou-Moustafa, Dale Schuurmans, and Frank Ferrie
    "Learning a Metric Space for Neighbourhood Topology Estimation",
    Asian Conf. on Machine Learning, JMLR W&CP 29: pp. 341-356, 2013.
    [BibTeX] [PDF]

  • Karim Abou-Moustafa, Frank Ferrie and Dale Schuurmans
    "Divergence Based Graph Estimation for Manifold Learning",
    IEEE Global Conf. on Signal and Information Processing, Austin, TX, Dec. 2013.
    [BibTeX] [PDF]

  • Karim Abou-Moustafa and Frank Ferrie,
    "A Note on Metric Properties for Some Divergence Measures: The Gaussian Case",
    Asian Conf. on Machine Learning, JMLR W&CP 25: pp. 1-15, 2012.
    [BibTeX] [PDF]

  • Karim Abou-Moustafa and Frank Ferrie,
    "Modified Divergence Measures for Gaussian Densities",
    LNCS 7626, Proc. of the IAPR Int. Workshop on Structural, Syntactic, Statistical Pattern Recognition (S+SSPR), pp. 426-436, Springer 2012.
    [BibTeX] [PDF]

  • Karim Abou-Moustafa and Frank Ferrie,
    "A Framework for Hypothesis Learning Over Sets of Vectors",
    Proc. of ACM's SIGKDD 9th Workshop on Mining and Learning with Graphs, 2011.
    [BibTeX] [PDF]

  • Karim Abou-Moustafa, Mohak Shah, Fernando De La Torre, and Frank Ferrie,
    "Relaxed Exponential Kernels for Unsupervised Learning",
    LNCS 6835, Pattern Recognition, Proc. of the 33rd DAGM Symposium, pp. 184-195, Springer, 2011.
    [BibTeX] [PDF]

  • Karim Abou-Moustafa, Fernando De La Torre, and Frank Ferrie,
    "Designing a Metric for the Difference Between Two Gaussian Densities",
    Advances in Intelligent and Soft Computing; J. Angeles, B. Boulet, J. Clark, J. Kovecses and K. Siddiqi (Eds.), Vol. 83, pp. 57 - 70, Springer, Dec. 2010.
    [BibTeX] [PDF]

  • Karim Abou-Moustafa, Fernando De La Torre, and Frank Ferrie,
    "Pareto Discriminant Analysis",
    IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pp. 3602 - 3609, 2010.
    [BibTeX] [PDF] [and here]

  • Karim Abou-Moustafa and Frank Ferrie,
    "Local Metric Learning on Manifolds with Applications to Query-based Operations",
    LNCS 5342, Proc. of the IAPR Int. Workshop on Structural, Syntactic, Statistical Pattern Recognition (S+SSPR), pp. 872 - 838, Springer, 2008.
    [BibTeX] [PDF]

  • Karim Abou-Moustafa and Frank Ferrie,
    "Fast and Regularized Local Metric for Query-based Operations",
    IEEE Proc. of the 19th Int. Conf. on Pattern Recognition (ICPR), 2008.
    [BibTeX] [PDF]

  • Karim Abou-Moustafa and Frank Ferrie,
    "The Minimum Volume Ellipsoid Metric",
    LNCS 4713, Pattern Recognition, Proc. of the 29th DAGM Symposium, pp. 335 - 344, Springer, 2007.
    [BibTeX] [PDF]

  • Karim Abou-Moustafa, Mohamed Cheriet, and Ching Suen,
    "Classification of Time-Series Data Using a Generative/Discriminative Hybrid",
    IEEE Proc. of the 9th Int. Workshop on Frontiers in Handwriting Recognition (IWFHR), pp. 51 - 56, 2004.
    [BibTeX] [PDF]

  • Karim Abou-Moustafa, Mohamed Cheriet, and Ching Suen,
    "A Generative-Discriminative Hybrid for Sequential Data Classification",
    IEEE Proc. of the Int. Conf. on Acoustics, Speech & Signal Processing (ICASSP), Vol. 5, pp. 805 - 808, 2004.
    [BibTeX] [PDF]

  • Karim Abou-Moustafa, Mohamed Cheriet, and Ching Suen,
    "On The Structure of Hidden Markov Models",
    Pattern Recognition Letters, Vol. 25, pp. 923 - 931, June 2004.
    [BibTeX] [PDF]

[Non-Refereed Publications]

  • Karim Abou-Moustafa,
    "What is The Distance Between Objects in a Data Set?",
    IEEE Pulse Magazine, Vol. 7, No. 2, pp. 41-47, March-April, 2016.
    [BibTeX] [link]

  • Karim Abou-Moustafa,
    "On Derivatives of Eigenvalues and Eigenvectors of the Generalized Eigenvalue Problem",
    McGill Tech. Report No. TR-CIM-10-09, 2009.
    [BibTeX] [PDF]

[Theses]

  • Karim T. Abou-Moustafa, "Metric Learning Revisited. New Approaches for Supervised and Unsupervised Metric Learning with Analysis and Algorithms", Ph.D. Thesis, McGill University, Montréal, QC, Canada, 2011.

  • Karim T. Abou-Moustafa, "A Generative-Discriminative Framework for Time-Series Data Classification", Masters Thesis, Concordia University, Montréal, QC, Canada, 2004.


[Awards]

  • FQRNT Postdoctoral Fellowship, 2011 - 2013.

    From "Le Fonds Quebecois de la Recherche sur la Nature et les Technologies" (FQRNT).

  • FQRNT - REPARTI Scholarship for International Training, 2009.

    From "Le Fonds Quebecois de la Recherche sur la Nature et les Technologies" (FQRNT), and "Le Regroupement strategique pour l'etude des Environnments Partages Intelligents" (REPARTI).

  • PRECARN Student Scholarship, 2004.

    From The Institute of Robotics and Intelligent Systems (IRIS) and PRECARN Incorporated.


[Activities]


[Contact Information]

Mailing Address

Amazon.com Inc.
300 Pine street
Seattle, WA 98181, U.S.A