{"id":20276914,"url":"https://github.com/guptaachin/us-visa-data-analysis","last_synced_at":"2026-02-25T05:32:31.120Z","repository":{"id":106631954,"uuid":"147231870","full_name":"guptaachin/US-VISA-Data-Analysis","owner":"guptaachin","description":"Analyzing the different parameters that make a successful US-VISA Application.","archived":false,"fork":false,"pushed_at":"2018-09-10T19:52:07.000Z","size":23027,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"master","last_synced_at":"2025-10-25T22:34:52.624Z","etag":null,"topics":["dataanalytics","documentation","tableau","visualization"],"latest_commit_sha":null,"homepage":null,"language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/guptaachin.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2018-09-03T16:56:21.000Z","updated_at":"2018-09-10T20:15:39.000Z","dependencies_parsed_at":"2023-03-23T11:33:53.826Z","dependency_job_id":null,"html_url":"https://github.com/guptaachin/US-VISA-Data-Analysis","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/guptaachin/US-VISA-Data-Analysis","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/guptaachin%2FUS-VISA-Data-Analysis","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/guptaachin%2FUS-VISA-Data-Analysis/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/guptaachin%2FUS-VISA-Data-Analysis/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/guptaachin%2FUS-VISA-Data-Analysis/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/guptaachin","download_url":"https://codeload.github.com/guptaachin/US-VISA-Data-Analysis/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/guptaachin%2FUS-VISA-Data-Analysis/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":29811543,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-02-25T03:30:18.102Z","status":"ssl_error","status_checked_at":"2026-02-25T03:30:17.799Z","response_time":61,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["dataanalytics","documentation","tableau","visualization"],"created_at":"2024-11-14T13:16:12.353Z","updated_at":"2026-02-25T05:32:31.100Z","avatar_url":"https://github.com/guptaachin.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# US-VISA-Data-Analysis\nQuick look up : [The Tableau dashboard](https://public.tableau.com/shared/32BSS87B3?:display_count=yes), [SPAP](https://github.com/gauscian/US-VISA-Data-Analysis/blob/master/%5BSPAP%5DWhat-maximizes-the-chances-of-a-US-VISA-being-Certified.png)\n## Why am I creating this repo?\nThis mini project is dedicated to proposing an [organized way](https://www.coursera.org/learn/analytics-tableau) of working through a Data Science problem. I see Data Analysis as the most important part of a data science project. If you do not know your data well, you can never produce conclusive results. In addition to this it is particularly impossible to model a data effectively if you do know how your fields impact the target attribute. \u003cbr\u003e Further, according to my observation the only stand alone reason for the cloud around the definition of Data Science is actually the absence of a defined paradigm for solving the Data Science problem. I think there isn't any. This method I am about to introduce will however help a beginner to at least approach a problem with some clarity in mind and be confident of the end product.\u003cbr\u003e\n\n \u003e For me Data Science is an art of first being passionate about your problem, understanding the problem statement, collecting the relevant data and analyze it to solve the problem at hand.\n\n I think this is pretty much I try to do with each one of my data science project. Let me try to break down my definition of data science. \u003cbr\u003e\n\n 1. Being passionate about your problem - This allows you to put in the required amount of mental resources to actually think about the problem and formulate a problem statement. i.e a Specific, Measurable, Attainable, Relevant, Time bound **(SMART goal)**. For instance, if you aspire to come to USA to have a taste of real innovation, you might wonder about the most successful route of getting your VISA approved.\n 2. After you are down with your problem statement, you should have a **dependent variable**, the variable you want to make conclusions about. Here it is Case Status.\n 3. Followed by this you might want to think about the **independent variables**, the variables that would affect this dependent variable.\n 4. Further you can go ahead and think about the ways (or specific **visualizations**) you can make to be clear about the relation ships about your IVs and DV.\n 5. Going ahead down the line, use a tool to create the planned visualizations and digest the overall theme of your data. It is important to be sure what your data says about your DV. This will help you to get a feel of fields you would want to use when you use ML algorithms to model your data.\n 6. Finally now that you are clear about your final list of IVs and their relationship with the DV, you can go ahead and model the problem with any of the **machine learning algorithms**.\n\n The most important point - Since this is just a paradigm, it takes a lot of practice to master the skill. So be patient and be curious. \u003cbr\u003e\n Be sure to have all the information in a mind map (SPAP). So that you can re iterate over it and continuously evolve it.\n\n## Introducing the motivation\nWhen a US company wants to hire someone from outside of the United States for a technical\nposition, they have to file an application to the United States government to get a green card or visa\nfor the foreign applicant. These applications allow the US government to track who is entering and\nleaving the country for work-related reasons, and ensure that immigrants are neither being taken\nadvantage of nor causing adverse effects for U.S. workers. To ensure equity for US and non-US\nworkers, companies have to state how much they are planning on paying the employee every time\nthey submit a visa or green card application. They also have to state the average amount an\nemployee with similar skills and background typically gets paid for the same position, a figure\ncalled “the prevailing wage.” This publically available data provides a unique view into what types\nof salaries you might encounter for different data–related jobs in the US.\u003cbr\u003e\n\n*Skewed nature of the data* - makes the predictive modelling around case status impractical.\u003cbr\u003e\n![here](tableau-exports/Skewed-data-wrt-case-status.png)\n\n\n\n\n## Source\nThe original data was compiled by the [US Department of Labor’s Office of Foreign Labor\nCertification](http://www.foreignlaborcert.doleta.gov/performancedata.cfm).\n\n## Analysis\n\n### SPAP - (Structured Analysis Plan)\n[Here](https://github.com/gauscian/US-VISA-Data-Analysis/blob/master/%5BSPAP%5DWhat-maximizes-the-chances-of-a-US-VISA-being-Certified.png) is the SPAP for the above problem. This plan was prepared after a brief look into the data and imbibing the overall intuition of the data.\n#### Description of the SPAP :\n    1.  Layer 1 : SMART GOAL of the analysis. Since the data I analyze is particularly skewed with respect to the Case Status field I only use the SMART goal for thinking about the possible Independent variables.\n    2.  Layer 2 : Dependent variable (Case status in our case).\n    3.  Layer 3 : Independent varibles hypothesized to be associated with the Case Status variable.\n    4.  Layer 4 : Describes the plan for the vizualizations to be created.\n    5.  Layer 5 : Is the reference to the conclusions drawn as mentioned in this report.\n\n\n### My Analysis of the problem.\n\nList of related resources.\n1. [The Tableau dashboard](https://public.tableau.com/shared/32BSS87B3?:display_count=yes)\n2. [SPAP](https://github.com/gauscian/US-VISA-Data-Analysis/blob/master/%5BSPAP%5DWhat-maximizes-the-chances-of-a-US-VISA-being-Certified.png)\n\nThe last layer of the SPAP defines the numbers I use here to conclude on my findings about the respective Independent variables.\u003cbr\u003e\n\n### Here are visualizations I created to digest the data\n\n1. Affect of Job Title Sub Group\n![here](tableau-exports/DoesJTBAffectCaseStatus.png)\n2. Distribution of VISA applications across States in USA\n![here](tableau-exports/DistributionOfVISAappsAcrossStates.png)\n3. Analyzing the relation between Case status and the VISA received date\n![here](tableau-exports/ReceviedDateForH1B.png)\n4. Affect of VISA type\n![here](tableau-exports/VISAtype.png)\n5. Concluding analysis on underpaying which clearly show the US's labour attitude in protecting its labour class.\n![here](tableau-exports/PaidPrev.png)","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fguptaachin%2Fus-visa-data-analysis","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fguptaachin%2Fus-visa-data-analysis","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fguptaachin%2Fus-visa-data-analysis/lists"}