{"id":27818105,"url":"https://microsoft.github.io/msmarco/","last_synced_at":"2025-05-01T15:40:39.931Z","repository":{"id":39800024,"uuid":"233923667","full_name":"microsoft/msmarco","owner":"microsoft","description":"website for MS Marco","archived":false,"fork":false,"pushed_at":"2025-03-26T20:44:18.000Z","size":7611,"stargazers_count":29,"open_issues_count":0,"forks_count":16,"subscribers_count":9,"default_branch":"master","last_synced_at":"2025-04-30T12:48:04.026Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"https://microsoft.github.io/msmarco/.","language":"JavaScript","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"cc-by-4.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/microsoft.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":"CODE_OF_CONDUCT.md","threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":"SECURITY.md","support":null,"governance":null,"roadmap":null,"authors":null,"dei":null}},"created_at":"2020-01-14T19:55:15.000Z","updated_at":"2025-04-04T02:02:59.000Z","dependencies_parsed_at":"2024-04-11T03:44:27.880Z","dependency_job_id":null,"html_url":"https://github.com/microsoft/msmarco","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/microsoft%2Fmsmarco","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/microsoft%2Fmsmarco/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/microsoft%2Fmsmarco/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/microsoft%2Fmsmarco/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/microsoft","download_url":"https://codeload.github.com/microsoft/msmarco/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":251901561,"owners_count":21662405,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-05-01T15:40:36.858Z","updated_at":"2025-05-01T15:40:39.923Z","avatar_url":"https://github.com/microsoft.png","language":"JavaScript","readme":"\r\n## Terms and Conditions\r\n\r\nThe MS MARCO and ORCAS datasets are intended for non-commercial research purposes only to promote advancement in the field of artificial intelligence and related areas, and is made available free of charge without extending any license or other intellectual property rights.\r\nThe datasets are provided \"as is\" without warranty and usage of the data has risks since we may not own the underlying rights in the documents.\r\nWe are not be liable for any damages related to use of the dataset.\r\nFeedback is voluntarily given and can be used as we see fit.\r\nBy using any of these datasets you are automatically agreeing to abide by these terms and conditions.\r\nUpon violation of any of these terms, your rights to use the dataset will end automatically.\r\n\r\nPlease contact us at ms-marco@microsoft.com if you own any of the documents made available but do not want them in this dataset.\r\nWe will remove the data accordingly.\r\nIf you have questions about use of the dataset or any research outputs in your products or services, we encourage you to undertake your own independent legal review.\r\nFor other questions, please feel free to contact us.\r\n\r\n## Contributing\r\n\r\nThis project welcomes contributions and suggestions.  Most contributions require you to agree to a\r\nContributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us\r\nthe rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.\r\n\r\nWhen you submit a pull request, a CLA bot will automatically determine whether you need to provide\r\na CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions\r\nprovided by the bot. You will only need to do this once across all repos using our CLA.\r\n\r\nThis project has adopted the [Microsoft Open Source Code of Conduct](https://opensource.microsoft.com/codeofconduct/).\r\nFor more information see the [Code of Conduct FAQ](https://opensource.microsoft.com/codeofconduct/faq/) or\r\ncontact [opencode@microsoft.com](mailto:opencode@microsoft.com) with any additional questions or comments.\r\n\r\n## Legal Notices\r\n\r\nMicrosoft and any contributors grant you a license to the Microsoft documentation and other content\r\nin this repository under the [Creative Commons Attribution 4.0 International Public License](https://creativecommons.org/licenses/by/4.0/legalcode),\r\nsee the [LICENSE](LICENSE) file, and grant you a license to any code in the repository under the [MIT License](https://opensource.org/licenses/MIT), see the\r\n[LICENSE-CODE](LICENSE-CODE) file.\r\n\r\nMicrosoft, Windows, Microsoft Azure and/or other Microsoft products and services referenced in the documentation\r\nmay be either trademarks or registered trademarks of Microsoft in the United States and/or other countries.\r\nThe licenses for this project do not grant you rights to use any Microsoft names, logos, or trademarks.\r\nMicrosoft's general trademark guidelines can be found at \u003chttp://go.microsoft.com/fwlink/?LinkID=254653\u003e.\r\n\r\nPrivacy information can be found at \u003chttps://privacy.microsoft.com/en-us/\u003e.\r\n\r\nMicrosoft and any contributors reserve all other rights, whether under their respective copyrights, patents,\r\nor trademarks, whether by implication, estoppel or otherwise.\r\n","funding_links":[],"categories":["Anthropomorphic-Taxonomy","Benchmarks \u0026 Datasets","📊 Datasets for Deep Research","Datasets and Benchmarks"],"sub_categories":["Typical Intelligence Quotient (IQ)-General Intelligence evaluation benchmarks","Domain-Specific Benchmarks","🛠️ Agent Frameworks","Evaluation"],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/microsoft.github.io%2Fmsmarco%2F","html_url":"https://awesome.ecosyste.ms/projects/microsoft.github.io%2Fmsmarco%2F","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/microsoft.github.io%2Fmsmarco%2F/lists"}