Web Scraping Engineer - On-Site (U.S. Citizen - Active Secret Clearance)
Arlington VA, VA
Job Type
Full-time
Description

NOTE:   This is a full-time employment opportunity (No C2C, subcontractor or 1099 engagements, please.). The candidate MUST be a U.S. Citizen and have an active secret clearance.  This is an on-site opportunity - candidate must be in local to the Arlington VA area. 


Daily Responsibilities 

  • Position supports the development and maintenance of the agency’s web scraping infrastructure. The position is responsible for extracting data from various websites and APIs, ensuring data quality and accuracy, and optimizing the scraping process for efficiency. Duties include:
  • Develop and maintain web scraping scripts and tools to extract data from websites and APIs.
  • Collaborate with cross-functional teams to understand data requirements and implement scraping solutions accordingly.
  • Monitor and troubleshoot scraping processes to ensure data quality and accuracy.
  • Optimize scraping scripts for performance and efficiency, considering factors such as speed, scalability, and resource utilization.
  • Stay up to date with the latest web scraping techniques, tools, and best practices.
  • Conduct data analysis and validation to ensure the integrity of scraped data.
  • Collaborate with data engineering and data science teams to integrate scraped data into our data pipelines and systems.
  • Document and communicate technical solutions, processes, and best practices to team members.
Requirements
  • Must be a U.S. Citizen (no dual status) as mandated by our government client.
  • Must have an active secret clearance.
  • Must be local to the Arlington VA metro area, no convenience travel.
  • 3+ years of professional experience in web scraping or a similar role.
  • Proficiency in Python and Java and experience with web scraping libraries such as Beautiful Soup, Scrapy, or Selenium.
  • Knowledge of AI/machine learning techniques for data extraction and classification.
  • Understanding of HTML, CSS, and JavaScript to navigate and interact with websites.
  • Experience working with APIs and handling different data formats (JSON, XML, etc.).
  • Familiarity with database systems and SQL for data storage and retrieval.
  • Familiarity of data cleaning and preprocessing techniques to ensure data quality.
  • Strong problem-solving skills and ability to troubleshoot and debug scraping issues.
  • Excellent communication and collaboration skills to work effectively in a team environment.
  • Attention to detail and ability to handle large volumes of data efficiently.
  • BS/BA degree in Computer Science, Information Sciences, or related IT discipline. Additional years of related professional experience can be substituted for a BS/BA degree.


Additional Qualifications (a PLUS):

  • Experience with cloud platforms for scalable web scraping infrastructure.
  • Familiarity with data visualization tools and techniques.
  • Understanding of legal and ethical considerations related to web scraping.

  

Compensation decisions depend on a wide range of factors, including but not limited to skill sets, experience and training, security clearances, licensure and certifications, location, and other business and organizational needs.


Salary Description
95,000-105,000