Skip to main content
  1. Data Science Blog/

Github Repos for Data Science

·2837 words·14 mins· loading · ·
Data Science Version Control GitHub Data Science Resources Data Science Resources GitHub
Share with :

Github Repos for Data Science

Github-Repos-for-DataScience
#

Sno.Repo NameRepo DescriptionLanguageStarredFork
1LinkA curated list of awesome transformer models.50435
2Link:memo: An awesome Data Science repository to learn and apply for real world problems.213505461
3LinkDetailed and tailored guide for undergraduate students or anybody want to dig deep into the field of AI with solid foundation.6440900
4LinkA collaborative catalog of NLP resources for Indic languages43264
5LinkThe code from the Machine Learning Bookcamp book and a free course based on the bookJupyter Notebook60421562
6LinkPublication-ready NN-architecture schematics.JavaScript3770478
7LinkThis is a very early attempt at having chatGPT work within a telegram botPython1639247
8LinkCode and data associated with the book “Statistics for Data Scientists: 50 Essential Concepts”R1018633
9LinkSource Code for ‘Text Analytics with Python,’ 2nd Edition by Dipanjan SarkarJupyter Notebook7270
10Link365 Days Computer Vision Learning Linkedin Post395129
11Link500 AI Machine learning Deep learning Computer vision NLP Projects with code126073747
12LinkComputer Vision Papers of the week164
13LinkVarious vrittis associated with the ashtadhyayiPython85
14LinkPython library to aid with your Hindi NLP tasksPython22
15LinkPython3827
16LinkA library for training and deploying machine learning models on Amazon SageMakerPython1856985
17Link200+ detailed flashcards useful for reviewing topics in machine learning, computer vision, and computer science.1631135
18LinkA curated list of community detection research papers with implementations.Python2145356
19LinkFast Python Collaborative Filtering for Implicit Feedback DatasetsPython3177599
20LinkState-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc.Python1449317
21Link🌸 Run 100B+ language models at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloadingPython4807175
22LinkFramework for fast prototyping of Graph Neural NetworksPython3715
23LinkBookNLP, a natural language processing pipeline for booksPython70074
24LinkThe Carpentries websiteHTML62122
25LinkInstructor Training161271
26LinkA curated list of awesome Deep Learning tutorials, projects and communities.210305852
27LinkThe standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.Python6057500
28LinkA project to deploy an online app that predicts the win probability for each NBA game every day. Demonstrates end-to-end Machine Learning deployment.Jupyter Notebook10412
29LinkLearn deep learning with tensorflow2.0, keras and python through this comprehensive deep learning tutorial series. Learn deep learning from scratch. Deep learning series for beginners. Tensorflow tutorials, tensorflow 2.0 tutorial. deep learning tutorial python.Jupyter Notebook6431685
30Link🦘 Explore multimedia datasets at scaleJupyter Notebook92041
31LinkPython library for converting Python calculations into rendered latex.CSS5211398
32LinkAllows to scale the ChatGPT API to multiple simultaneous sessions with infinite contextual and adaptive memory powered by GPT and Redis datastore.Python36339
33LinkJupyter Notebook979617732
34Link🧠 Material for the Deep Learning Study Group38552
35LinkExplanation to key concepts in ML4297355
36Link✍️ A carefully curated list of NLP paper summaries1453249
37LinkPython1612
38LinkFish Weight Prediction DeploymentPython10
39LinkJupyter Notebook10
40LinkMalaria Detection DeployedPureBasic10
41LinkNLPJupyter Notebook20
42Linkcode for deep learning coursesJupyter Notebook877282
43LinkEpidemic Modeling for EveryoneJupyter Notebook26172
44LinkTo map publicly available datasets related to General Assembly (Lok Sabha) elections in India.Jupyter Notebook137114
45LinkFree MLOps course from DataTalks.ClubJupyter Notebook71751416
46LinkEasily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.Python2721235
47LinkDemocratizing Deep-Learning for Drug Discovery, Quantum Chemistry, Materials Science and BiologyPython43671499
48LinkList of Computer Science courses with video lectures.567258028
49LinkContains relevant notebooks for the hands-on NLP workshop for the Analytics India Magazine Plugin Conference -2020 EditionJupyter Notebook7045
50LinkExtensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage machine learning, deep learning and deep transfer learning to learn and solve popular tasks using NLP including NER, Classification, Recommendation \ Information Retrieval, Summarization, Classification, Language Translation, Q&A and Topic Models.Jupyter Notebook13065
51LinkData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.Python251697588
52LinkA tool for refurbishing and modernizing Python codebasesPython225044
53LinkMaterials for Mathematical Tools for Neuroscience course at Harvard (Neurobio 212)Jupyter Notebook41055
54LinkMlOps End 2 EndJupyter Notebook1311
55Link📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.242813381
56Link🪐 End-to-end NLP workflows from prototype to productionPython1106444
57Link💫 Industrial-strength Natural Language Processing (NLP) in PythonPython263184134
58LinkCode release for “Dropout Reduces Underfitting”Python29016
59LinkFacebook AI Research Sequence-to-Sequence Toolkit written in Python.Python262665833
60LinkLibrary for fast text representation and classification.HTML246854608
61LinkHiPlot makes understanding high dimensional data easyTypeScript2485125
62LinkInference code for LLaMA modelsPython230643680
63LinkThe fastai book, published as Jupyter NotebooksJupyter Notebook185757081
64LinkFree online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra courseJupyter Notebook94422413
65LinkList of Data Science Cheatsheets to rule the world122113437
66LinkA very simple framework for state-of-the-art Natural Language Processing (NLP)Python128522027
67LinkfreeCodeCamp.org’s open-source codebase and curriculum. Learn to code for free.TypeScript36843032357
68LinkLearn how to responsibly develop, deploy and maintain production machine learning applications.Jupyter Notebook333425462
69LinkQuick tool to draw fully connected neural network architectures425
70LinkGoogle ResearchJupyter Notebook296647307
71LinkState of the Art Language models and Classifier for Sanskrit language (ancient indian language)Jupyter Notebook6320
72LinkPlotting Assignment 1 for Exploratory Data AnalysisR10
73LinkA curated list of awesome embedding models tutorials, projects and communities.Jupyter Notebook1629243
74Link[ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.Python504
75Link🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorchPython154073082
76Link✨Fast Coreference Resolution in spaCy with Neural NetworksC2698470
77Link🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.Python10337120879
78LinkAn NLP workshop about concrete solutions to real problemsJupyter Notebook1078453
79Link⚡ Building applications with LLMs through composability ⚡Python462505424
80LinkBadges for your personal developer branding, profile, and projects.SCSS81331193
81LinkInstruction Tuning with GPT-4HTML2805198
82LinkSeamlessly integrate powerful language models like ChatGPT into scikit-learn for enhanced text analysis tasks.Python1789139
83LinkOpen3D: A Modern Library for 3D Data ProcessingC++89991987
84LinkDnotebook is a Jupyter-like library for javaScript environment. It allows you to create and share pages that contain live code, text and visualizations.TypeScript13910
85LinkAn unnecessarily tiny implementation of GPT-2 in NumPy.Python2392301
86Link:octocat: Machine Learning for Cyber Security59641626
87LinkA generic, simple and fast implementation of Deepmind’s AlphaZero algorithm.Julia1132119
88LinkA curated list of awesome Machine Learning frameworks, libraries and software.Python5906114052
89LinkCore functionality for the MLJ machine learning frameworkJulia14039
90LinkGeneral Assembly’s Data Science course in Washington, DCJupyter Notebook187212
91LinkMetaSeg: Packaged version of the Segment Anything repositoryPython64941
92LinkEssential Cheat Sheets for deep learning and machine learning researchers https://medium.com/@kailashahirwar/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5146193457
93LinkWeb interface for browsing, search and filtering recent arxiv submissionsPython48461319
94LinkA Python framework for creating maintainable and modular data science code.Python8421796
95LinkDrench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!HTML112072832
96LinkPython67100
97Link😎 Awesome list of tools and projects with the awesome LangChain framework2834145
98Link🧑‍🏫 59 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, …), optimizers (adam, adabelief, …), gans(cyclegan, stylegan2, …), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, … 🧠Jupyter Notebook240472580
99LinkLet us control diffusion models!Python204721899
100LinkImplementation of DALL-E 2, OpenAI’s updated text-to-image synthesis neural network, in PytorchPython9799934
101LinkGUI-based software for training, evaluating and applying deep neural nets for image classificationPython8218
102LinkPRegEx - Programmable Regular ExpressionsPython71821
103LinkMatplotlib Jupyter IntegrationTypeScript1434216
104LinkCode for the Behavior Retrieval PaperPython91
105LinkExamples of Data Science projects and Artificial Intelligence use-casesJupyter Notebook344267
106Link10 Weeks, 20 Lessons, Data Science for All!Jupyter Notebook196393872
107Link12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for allJupyter Notebook4922710169
108LinkTransformers at any scalePython166792
109LinkLab files for AI-102 - AI EngineerC#342452
110LinkGitHub User Guide for MCTs3825
111LinkSoftware and Data Carpentry instructor training course materialHTML20
112LinkLightwood is Legos for Machine Learning.Python36982
113LinkBuild Web Apps in Jupyter Notebook with Python onlyPython3117188
114LinkPython67
115Linkgpt4all: an ecosystem of open-source chatbots trained on a massive collections of clean assistant data including code, stories and dialogueC++457864869
116LinkEasy-to-use JavaScript library for most common data analysis tasks.TypeScript1218
117LinkOfficial PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.Python59547
118LinkA comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.Jupyter Notebook4139654
119LinkOverview of Modern Deep Learning Techniques Applied to Natural Language ProcessingCSS1294198
120Link🌊 Online machine learning in PythonPython4262474
121LinkExamples and guides for using the OpenAI APIJupyter Notebook385185764
122LinkA C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.C3718392
123LinkPython bindings to libpostal for fast international address parsing/normalizationC67880
124LinkNeuralProphet: A simple forecasting packagePython2977419
125LinkJupyter Notebook7989
126LinkThe full dataset behind paperswithcode.com26727
127LinkHindi POS Tags and keywords using TNT model. Created Date: 28 Sept 2018Python2210
128LinkPython12549
129LinkThe fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️Python3088210
130LinkThis is used for identifying whether a given text has sarcasm in it or not.Java10
131Link“Probabilistic Machine Learning” - a book series by Kevin MurphyJupyter Notebook3935484
132LinkUsing tensorboardX (tensorboard for pytorch) e.g. ploting more than one graph in the same chat etc.Python50
133LinkAn open-source, low-code machine learning library in PythonJupyter Notebook73731604
134LinkTensors and Dynamic neural networks in Python with strong GPU accelerationPython6766918541
135LinkBuild Low Code Automated Tensorflow explainable models in just 3 lines of code. Library created by: Hasan Rafiq - https://www.linkedin.com/in/sam04/Python17737
136LinkCode for 30DayChartChallengeR3411
137LinkDeep Web Extractor (DWX): Deep Web Extractor system is using statistical machine learning models for crawling and data discovery from the Deep Web (i.e., massive and quality portion of World Wide Web) to build knowledge based databases.HTML41
138Link💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistantsPython165104359
139LinkDEPRECATED: We recommend using Rasa X https://rasa.com/docs/rasa-x/ for managing NLU dataJavaScript467183
140LinkContainers for machine learningPython4838286
141LinkExamples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.Jupyter Notebook2403338
142LinkAn end to end Interactive Interface for correcting mistakes in OCR output.C++4546
143LinkSent2Vec encoder and training code from the paper “Skip-Thought Vectors”Python2050555
144LinkPyTorch code for Learning Deep Time-index Models for Time Series Forecasting (ICML 2023)Python25743
145LinkMerlion: A Machine Learning Framework for Time Series IntelligencePython2991258
146LinkThe SAS Scripting Wrapper for Analytics Transfer (SWAT) package is the Python client to SAS Cloud Analytic Services (CAS). It allows users to execute CAS actions and process the results all from Python.Python13454
147LinkDatasets for deep learning with satellite & aerial imagery24633
148LinkAlgorithms for outlier, adversarial and drift detectionPython1838180
149LinkAn MLOps framework to package, deploy, monitor and manage thousands of production machine learning modelsHTML3754758
150LinkDeep Learning book the covers the principles of deep learning, motivation, explanations, state of the art papers for the various tasks and architectures: CNNs, object detection, semantic segmentation, generative models, denoising, super resolution, style transfer and style manipulation, inpaintig, self supervised learning, vision transformers, OCR, and multi modal. Hope that it will be useful to some of you 🙂9120
151LinkThis shows how to fine-tune Bert language model and use PyTorch-transformers for text classififcationJupyter Notebook6335
152LinkA Machine Learning project to translate Sanskrit text to EnglishJupyter Notebook3722
153LinkData and code for “DocPrompting: Generating Code by Retrieving the Docs” @ICLR 2023Python16510
154LinkModel to predict the sentiment of Hindi sentences developed this model during my 2nd-year Internship @ algo8.aiJupyter Notebook81
155LinkCode Sample of Book “Effective Python: 59 Specific Ways to Write Better Pyton” by Brett SlatkinPython1362213
156LinkThis repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)Python2855224
157LinkCSCI-544 Final ProjectPython96
158LinkStableLM: Stability AI Language ModelsJupyter Notebook14581878
159LinkAn awesome & curated list of best LLMOps tools for developersShell90583
160LinkTensorFlow GNN is a library to build Graph Neural Networks on the TensorFlow platform.Python986136
161LinkRich is a Python library for rich text and beautiful formatting in the terminal.Python435541569
162LinkJupyter Notebook22
163LinkMy attempt at researching Quantum Mechanics & Quantum Computing when I was a junior.Jupyter Notebook11655
164LinkAbout Code release for “Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting” (NeurIPS 2021), https://arxiv.org/abs/2106.13008Jupyter Notebook1132286
165Link🤖 Python examples of popular machine learning algorithms with interactive Jupyter demos and math being explainedJupyter Notebook213773912
166LinkJupyter Notebook4041
167LinkA comprehensive set of fairness metrics for datasets and machine learning models, explanations for these metrics, and algorithms to mitigate bias in datasets and models.Python2046687
168LinkJupyter Notebook102340
169LinkA python library for user-friendly forecasting and anomaly detection on time series.Python5968673
170LinkExtrapolating knowledge graphs from unstructured text using GPT-3 🕵️‍♂️JavaScript3502289
171LinkOfficial repo for paper “LeTI: Learning to Generate from Textual Interactions.”Python506
172LinkA Guide for Feature Engineering and Feature Selection, with implementations and examples in Python.Jupyter Notebook1118371
173LinkPython152332094
174LinkResoruce to help you to prepare for your comming data science interviews40965
175LinkPython867479
176LinkThe GitHub repository for the paper “Informer” accepted by AAAI 2021.Python3764851
177LinkScalable identity resolution, entity resolution, data mastering and deduplication using MLJava74185
Dr. Hari Thapliyaal's avatar

Dr. Hari Thapliyaal

Dr. Hari Thapliyal is a seasoned professional and prolific blogger with a multifaceted background that spans the realms of Data Science, Project Management, and Advait-Vedanta Philosophy. Holding a Doctorate in AI/NLP from SSBM (Geneva, Switzerland), Hari has earned Master's degrees in Computers, Business Management, Data Science, and Economics, reflecting his dedication to continuous learning and a diverse skill set. With over three decades of experience in management and leadership, Hari has proven expertise in training, consulting, and coaching within the technology sector. His extensive 16+ years in all phases of software product development are complemented by a decade-long focus on course design, training, coaching, and consulting in Project Management. In the dynamic field of Data Science, Hari stands out with more than three years of hands-on experience in software development, training course development, training, and mentoring professionals. His areas of specialization include Data Science, AI, Computer Vision, NLP, complex machine learning algorithms, statistical modeling, pattern identification, and extraction of valuable insights. Hari's professional journey showcases his diverse experience in planning and executing multiple types of projects. He excels in driving stakeholders to identify and resolve business problems, consistently delivering excellent results. Beyond the professional sphere, Hari finds solace in long meditation, often seeking secluded places or immersing himself in the embrace of nature.

Comments:

Share with :

Related

What is a Digital Twin?
·805 words·4 mins· loading
Industry Applications Technology Trends & Future Computer Vision (CV) Digital Twin Internet of Things (IoT) Manufacturing Technology Artificial Intelligence (AI) Graphics
What is a digital twin? # A digital twin is a virtual representation of a real-world entity or …
Frequencies in Time and Space: Understanding Nyquist Theorem & its Applications
·4103 words·20 mins· loading
Data Analysis & Visualization Computer Vision (CV) Mathematics Signal Processing Space Exploration Statistics
Applications of Nyquists theorem # Can the Nyquist-Shannon sampling theorem applies to light …
The Real Story of Nyquist, Shannon, and the Science of Sampling
·1146 words·6 mins· loading
Technology Trends & Future Interdisciplinary Topics Signal Processing Remove Statistics Technology Concepts
The Story of Nyquist, Shannon, and the Science of Sampling # In the early days of the 20th century, …
BitNet b1.58-2B4T: Revolutionary Binary Neural Network for Efficient AI
·2637 words·13 mins· loading
AI/ML Models Artificial Intelligence (AI) AI Hardware & Infrastructure Neural Network Architectures AI Model Optimization Language Models (LLMs) Business Concepts Data Privacy Remove
Archive Paper Link BitNet b1.58-2B4T: The Future of Efficient AI Processing # A History of 1 bit …
Ollama Setup and Running Models
·1753 words·9 mins· loading
AI and NLP Ollama Models Ollama Large Language Models Local Models Cost Effective AI Models
Ollama: Running Large Language Models Locally # The landscape of Artificial Intelligence (AI) and …