{"id":24539724,"url":"https://github.com/jayfunc/Doc2Vec-Server","last_synced_at":"2025-10-03T15:31:27.735Z","repository":{"id":202685673,"uuid":"707916501","full_name":"jayllfilebyte/doc2vec_server","owner":"jayllfilebyte","description":"A Python server to be invoked doc2vec method, which uses TensorFlow and BERT model.","archived":false,"fork":false,"pushed_at":"2023-10-21T03:43:24.000Z","size":17,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-01-13T02:38:46.901Z","etag":null,"topics":["bert","doc2vec","python","tensorflow"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"gpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/jayllfilebyte.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-10-21T01:16:35.000Z","updated_at":"2023-10-21T02:01:08.000Z","dependencies_parsed_at":null,"dependency_job_id":"d634af9a-76f3-4c7a-81f0-eeb3a0389488","html_url":"https://github.com/jayllfilebyte/doc2vec_server","commit_stats":null,"previous_names":["founchoo/doc2vec_server","zhefangbyte/doc2vec_server","jayllfilebyte/doc2vec_server"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jayllfilebyte%2Fdoc2vec_server","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jayllfilebyte%2Fdoc2vec_server/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jayllfilebyte%2Fdoc2vec_server/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jayllfilebyte%2Fdoc2vec_server/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/jayllfilebyte","download_url":"https://codeload.github.com/jayllfilebyte/doc2vec_server/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":235152338,"owners_count":18944168,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bert","doc2vec","python","tensorflow"],"created_at":"2025-01-22T17:15:58.462Z","updated_at":"2025-10-03T15:31:27.430Z","avatar_url":"https://github.com/jayllfilebyte.png","language":"Python","readme":"# Doc2Vec Server\nA Python back-end server supporting to be invoked doc2vec method by other language via HTTP request.\n\n# Doc2Vec method\nI take the reference [here](https://juejin.cn/s/bert%20%E6%96%87%E6%9C%AC%E7%89%B9%E5%BE%81%E6%8F%90%E5%8F%96)\n\n```\nimport torch\nfrom transformers import BertTokenizer, BertModel\n\n# 加载BERT模型和tokenizer\ntokenizer = BertTokenizer.from_pretrained('bert-base-uncased')\nmodel = BertModel.from_pretrained('bert-base-uncased')\n\n# 输入文本\ntext = \"Hello, how are you?\"\n\n# 将文本转换为BERT需要的输入格式\ninput_ids = torch.tensor(tokenizer.encode(text, add_special_tokens=True)).unsqueeze(0) \n\n# 使用BERT模型提取文本特征\noutputs = model(input_ids)\nword_embeddings = outputs.last_hidden_state  # 每个单词的词向量\nsentence_embedding = outputs.pooler_output  # 整个句子的句子向量\n```\n\n# Python server\n[Here](https://pythonbasics.org/webserver/) is the example.\n\n```\n# Python 3 server example\nfrom http.server import BaseHTTPRequestHandler, HTTPServer\nimport time\n\nhostName = \"localhost\"\nserverPort = 8080\n\nclass MyServer(BaseHTTPRequestHandler):\n    def do_GET(self):\n        self.send_response(200)\n        self.send_header(\"Content-type\", \"text/html\")\n        self.end_headers()\n        self.wfile.write(bytes(\"\u003chtml\u003e\u003chead\u003e\u003ctitle\u003ehttps://pythonbasics.org\u003c/title\u003e\u003c/head\u003e\", \"utf-8\"))\n        self.wfile.write(bytes(\"\u003cp\u003eRequest: %s\u003c/p\u003e\" % self.path, \"utf-8\"))\n        self.wfile.write(bytes(\"\u003cbody\u003e\", \"utf-8\"))\n        self.wfile.write(bytes(\"\u003cp\u003eThis is an example web server.\u003c/p\u003e\", \"utf-8\"))\n        self.wfile.write(bytes(\"\u003c/body\u003e\u003c/html\u003e\", \"utf-8\"))\n\nif __name__ == \"__main__\":        \n    webServer = HTTPServer((hostName, serverPort), MyServer)\n    print(\"Server started http://%s:%s\" % (hostName, serverPort))\n\n    try:\n        webServer.serve_forever()\n    except KeyboardInterrupt:\n        pass\n\n    webServer.server_close()\n    print(\"Server stopped.\")\n```\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjayfunc%2FDoc2Vec-Server","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fjayfunc%2FDoc2Vec-Server","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjayfunc%2FDoc2Vec-Server/lists"}