{"id":29610980,"url":"https://github.com/lukaszkn/data-software-engineering-interview-questions","last_synced_at":"2025-07-20T20:32:13.384Z","repository":{"id":303362752,"uuid":"1014942788","full_name":"lukaszkn/data-software-engineering-interview-questions","owner":"lukaszkn","description":"Data and Software engineering interview questions","archived":false,"fork":false,"pushed_at":"2025-07-07T07:17:36.000Z","size":1079,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-07-07T08:31:53.551Z","etag":null,"topics":["data","engineering","interview-questions","python"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/lukaszkn.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-07-06T17:56:04.000Z","updated_at":"2025-07-07T07:17:40.000Z","dependencies_parsed_at":"2025-07-07T08:33:14.840Z","dependency_job_id":"a04f52c8-af3b-49f9-b66c-4b74669dbc24","html_url":"https://github.com/lukaszkn/data-software-engineering-interview-questions","commit_stats":null,"previous_names":["lukaszkn/data-software-engineering-interview-questions"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/lukaszkn/data-software-engineering-interview-questions","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lukaszkn%2Fdata-software-engineering-interview-questions","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lukaszkn%2Fdata-software-engineering-interview-questions/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lukaszkn%2Fdata-software-engineering-interview-questions/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lukaszkn%2Fdata-software-engineering-interview-questions/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/lukaszkn","download_url":"https://codeload.github.com/lukaszkn/data-software-engineering-interview-questions/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lukaszkn%2Fdata-software-engineering-interview-questions/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":266193098,"owners_count":23890722,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data","engineering","interview-questions","python"],"created_at":"2025-07-20T20:30:52.169Z","updated_at":"2025-07-20T20:32:13.355Z","avatar_url":"https://github.com/lukaszkn.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# 4121 Data and Software engineering interview questions\n| Questions | Description |\n| --- | --- |\n| [Amazon Neptune](content/amazon_neptune.md) | A fast, fully managed database service powering graph use cases such as identity graphs, knowledge graphs, and fraud detection. |\n| [Ansible](content/ansible.md) | An open-source automation tool primarily used for configuration management, application deployment and orchestration |\n| [Apache Airflow](content/apache_airflow.md) | Apache Airflow |\n| [Apache Flink](content/apache_flink.md) | Apache Flink |\n| [Apache Flume](content/apache_flume.md) | Apache Flume |\n| [Apache HBase](content/apache_hbase.md) | Apache HBase |\n| [Apache Hive](content/apache_hive.md) | Apache Hive |\n| [Apache Kafka](content/apache_kafka.md) | Apache Kafka |\n| [Apache Spark](content/apache_spark.md) | Apache Spark |\n| [Apache Superset](content/apache_superset.md) | Apache Superset |\n| [AWS](content/aws.md) | AWS |\n| [AWS Glue](content/aws_glue.md) | A serverless data integration service that makes it easy for analytics users to discover, prepare, move, and integrate data from multiple sources |\n| [AWS Lambda](content/aws_lambda.md) | AWS Lambda |\n| [Azure](content/azure.md) | Azure |\n| [Azure Databricks](content/azure_databricks.md) | Azure Databricks |\n| [Azure Purview](content/azure_purview.md) | A unified data governance solution that helps organizations discover, manage, and govern their data estate across on-premises, multi-cloud, and SaaS environments |\n| [Big Data Engineering](content/bigdata.md) | Big Data engineering concepts and tools. |\n| [Data pipelines](content/data_pipelines.md) | Data pipelines basics |\n| [Data Warehousing](content/dwha.md) | Data Warehousing Architecture |\n| [Databricks Machine Learning](content/databricks_machine_learning.md) | Databricks Machine Learning |\n| [dbt](content/dbt.md) | dbt |\n| [Delta Lake](content/delta_lake.md) | A flexible storage pattern that is typically used for storing massive amounts of raw data in its native format |\n| [Elasticsearch](content/elasticsearch.md) | A search engine based on Apache Lucene, a free and open-source search engine. It provides a distributed, multitenant-capable full-text search engine with an HTTP web interface and schema-free JSON documents. |\n| [FastAPI](content/fastapi.md) | A high-performance web framework for building HTTP-based service APIs in Python |\n| [General](content/general.md) | General programming concepts, design patterns |\n| [General Data Engineer interview](content/general_interview.md) | General, behavioral, communication, collaboration, problem solving from data engineering perspective |\n| [Google Cloud Platform](content/gcp.md) | Google Cloud Platform |\n| [Grafana](content/grafana.md) | A multi-platform open source analytics and interactive visualization web application. |\n| [Hadoop](content/hadoop.md) | Hadoop |\n| [Jenkins](content/jenkins.md) | An open source automation server. It helps automate the parts of software development related to building, testing, and deploying |\n| [Jetpack Compose](content/jetpack_compose.md) | Basics |\n| [Kotlin Basics](content/kotlin.md) | Basic syntax, functions, variables, classes, conditional expressions, loops, ranges, collections, nullable values |\n| [Machine learning](content/machine_learning.md) | Basic concepts |\n| [MongoDB](content/mongodb.md) | MongoDB |\n| [Pandas](content/pandas.md) | A software library written for the Python for data manipulation and analysis |\n| [Polars](content/polars.md) | Polars |\n| [Power BI](content/power_bi.md) | A business analytics and data visualization tool |\n| [Power BI DAX](content/power_bi_dax.md) | Power BI DAX |\n| [PySpark](content/pyspark.md) | PySpark |\n| [Python](content/python.md) | The basics, interpreter, numbers, text, lists, sets, dictionaries, control flow, loops, functions |\n| [Python Advanced](content/pythonadvanced.md) | Functions, annotations, coding style, reading and writing files, classes, iterators, standard library |\n| [Python How-To](content/pythonhowto.md) | How-to's |\n| [RxSwift](content/rxswift.md) | Basics of RxSwift |\n| [Scala](content/scala_de.md) | Scala for data engineering |\n| [Scala Essential](content/scala.md) | Essential Scala programming concepts |\n| [Snowflake](content/snowflake.md) | A cloud data platform that at it's core features a columnar-stored data warehouse |\n| [SQL](content/sql.md) | SQL |\n| [SQL How to](content/sqlhowto.md) | SQL tips \u0026 tricks |\n| [Swift Advanced](content/swiftadvanced.md) | Properties, subscripts, concurrency, type casting, nested types, extensions, protocols, generics, Combine framework |\n| [Swift Basics](content/swift.md) | The basics, string and characters, collection types, control flow, functions, closures, enumerations, structures and classes, properties, methods |\n| [Swift UI Advanced](content/swiftuiadvanced.md) | Advanced topics and how-to's |\n| [Swift UI Basics](content/swiftui.md) | Walk through the building blocks of a SwiftUI |\n| [Tableau](content/tableau.md) | Tableau |\n| [Terraform](content/terraform.md) | An infrastructure as code tool that lets you build, change, and version infrastructure safely and efficiently |\n\n\n[All questions](content/_all.md)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flukaszkn%2Fdata-software-engineering-interview-questions","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Flukaszkn%2Fdata-software-engineering-interview-questions","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flukaszkn%2Fdata-software-engineering-interview-questions/lists"}