Find Jobs
Hire Freelancers

Implement python code to generate term-docid pairs for a small startup project.

$10-30 CAD

Lukket
Slået op 7 måneder siden

$10-30 CAD

Betales ved levering
I am looking for a freelancer who can help implement python code for my small startup project. The project involves generating term-docid pairs for a dataset of less than 1000 documents. Skills and experience required for this job include: - Intermediate level python coding expertise - Knowledge of working with small datasets - Ability to generate term-docid pairs efficiently The code does not need to be extensively commented for future reference. Sub Project 1: Use reuters, the module that while there are still more documents to be processed, accepts a document as a list of tokens (omit punctuation) and outputs term-docID pairs. Instead of appending new term-docID pairings to a list, make sure you directly append the docID to the postings list for the term. You may use a hash table. No boxes required. (a) compare timing of this SPIMI inspired procedure with the naive indexer (for 10000 term-docID pairings). (b) compile an inverted index for Reuters21578 without using any compression techniques docID hint: Use the NEWID values from the Reuters corpus to make your retrieval comparable. Subproject II: Convert your indexer into a probabilistic search engine 1. using the assumptions made about independence of terms and documents etc. and 2. using the BM25 formula 3. rank the documents your index returns and 4. for a given query, return a ranked list of results. Notes: experiment with different values for the parameters k1 and b. Test queries: 1. design four test queries: (a) a single keyword query. Compare results for the same queries of Subproject I with the results for your Nave indexer (b) a multiple keyword query for Subproject I returning documents containing all the keywords (AND) for unranked retrieval (c) a multiple keywords query returning documents containing at least one keyword (OR), where documents are ordered by how many keywords they contain) (d) a query consisting of several keywords for ranking with BM25 2. run your four test queries to showcase your code and comment on the results in your report Deliverables: well documented sample runs for your queries on the information needs: (a) Democrats’ welfare and healthcare reform policies (b) Drug company bankruptcies (c) George Bush 4. any additional testing or aborted design ideas that show off particular aspects of your project.
Projekt-ID: 37395082

Om projektet

5 forslag
Projekt på afstand
Aktiv 5 måneder siden

Leder du efter muligheder for at tjene penge?

Fordele ved budafgivning på Freelancer

Fastsæt dit budget og din tidsramme
Bliv betalt for dit arbejde
Oprids dit forslag
Det er gratis at skrive sig op og byde på jobs
5 freelancere byder i gennemsnit $24 CAD på dette job
Brug Avatar.
Hi there, I am Hamza and I am excited to hear that you are looking for someone to help implement code for your small startup project. With my extensive experience in python coding and data processing specifically related to generating term-docid pairs, I believe I am the perfect fit for this job. I have intermediate level python coding expertise as well as knowledge of working with small datasets which will be beneficial in efficiently generating term-docid pairs for your project. Additionally, I am confident that my skillset can be used to create an effective probabilistic search engine from your indexer so that you can get fast results from your searches without having to manually append new term-docid pairings to a list - this is exactly what SPIMI inspired procedure does for Reuters21578 dataset (see Subproject 1).
$50 CAD på 1 dag
4,9 (83 anmeldelser)
5,6
5,6
Brug Avatar.
Hello there, I’m a python expert and I have a great experience with python libraries that your needs for your project. I can handle this project as required. Please contact me for more details. Best Regards !
$20 CAD på 2 dage
4,9 (18 anmeldelser)
4,1
4,1
Brug Avatar.
hI,i can deliver you the work within hours as i have a strong background in python and can efficiently generate term-docid pairs for your project. My experience in python coding enables me to develop an efficient code that can be used to generate term-docid pairs for your dataset within hours. Additionally, my expertise in working with small datasets enables me to efficiently generate term-docid pairs for your project without any issues. I understand the need for collaboration and effective communication when it comes to software projects. As such, I am committed to engaging in open dialogue and actively seeking feedback from clients during the project development process. This ensures that my work aligns with the client's vision and exceeds their expectations. I look forward to hearing from you soon!
$20 CAD på 2 dage
4,9 (11 anmeldelser)
3,6
3,6
Brug Avatar.
I understand that you are looking for someone to help implement python code for your small startup project involving generating term-docid pairs for a dataset of less than 1000 documents. I have extensive experience in python coding, specifically intermediate level Python expertise, knowledge of working with small datasets and ability to generate term-docid pairs efficiently. I believe my skillset is ideal for this job as I can use my experience in software architecture, website management and other relevant areas to create an optimum solution for your needs. Additionally my expertise includes developing probabilistic search engines using the BM25 formula which can be used to rank documents returned from an indexer and for a given query return a ranked list of results. Furthermore I can provide well documented sample runs for your queries on the information needs (Democrat's welfare and healthcare reform policies Drug company bankruptcies George Bush) any additional testing or aborted design ideas that show off particular aspects of your project
$20 CAD på 7 dage
0,0 (0 anmeldelser)
0,0
0,0
Brug Avatar.
Hey, I have read your entire job description and am confident that I can do this job of yours. I have a lot of experience in this field. I am an expert in Python, SQL and Software Architecture. I can complete this job of yours in a cost-effective and timely manner. Please feel free to contact me for this project
$11 CAD på 1 dag
0,0 (0 anmeldelser)
0,0
0,0

Om klienten

Flag for CANADA
Montreal, Canada
4,8
1
Betalingsmetode verificeret
Medlem siden aug. 27, 2022

Klientverificering

Tak! Vi har sendt dig en e-mail med et link, så du kan modtage din kredit.
Noget gik galt, da vi forsøgte at sende din mail. Prøv venligst igen.
Registrerede brugere Oprettede jobs i alt
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Indlæser forhåndsvisning
Geolokalisering er tilladt.
Din session er udløbet, og du er blevet logget ud. Log venligst ind igen.