@@ -4,7 +4,7 @@ Google Summer of Code 2019
4
4
==========================
5
5
6
6
AboutCode is participating in the Google Summer of Code in 2019 as a mentoring org. This page
7
- contain all the information for students and anyone else interested in helping.
7
+ contains all the information for students and anyone else interested in helping.
8
8
9
9
AboutCode is a family of FOSS projects to uncover data ... about software code:
10
10
@@ -13,7 +13,7 @@ AboutCode is a family of FOSS projects to uncover data ... about software code:
13
13
- what is its license? copyright?
14
14
- is the code secure, maintained, well coded?
15
15
16
- All these are questions that are important to answer: there are million
16
+ All these are questions that are important to answer: there are millions
17
17
of free and open source software components available on the web for reuse.
18
18
19
19
Knowing where a software package comes from, what is its license and if it is
@@ -420,11 +420,11 @@ DeltaCode projects
420
420
DeltaCode is a tool to compare and report scan differences.
421
421
When comparing files, it only uses exact comparison.
422
422
The goal of this project is to improve the usefulness of the delta by also
423
- finding files that are mostly the same (e.g. quasi or nrea duplicates) vs. files
423
+ finding files that are mostly the same (e.g. quasi or near duplicates) vs. files
424
424
that are completely different. Then the DeltaCode comparison core should be
425
425
updated accordingly to detect and report material changes to scans (such as
426
- new, update or removed licenses, origins and packages) when changes are also
427
- meterial in the code files (e.g. such that small changes may be ignored)
426
+ new, updated or removed licenses, origins and packages) when changes are also
427
+ material in the code files (e.g. such that small changes may be ignored)
428
428
429
429
430
430
- **Level **
@@ -454,7 +454,7 @@ TraceCode projects
454
454
455
455
TraceCode does system call tracing only today. The primary goal of this project
456
456
is to create a tool that provides the same results as the strace-based tracing
457
- but would be using using ELF symbols, DWARF debug symbols, signatures or string
457
+ but would be using ELF symbols, DWARF debug symbols, signatures or string
458
458
matching to determine when and how a source code file is built in a binary
459
459
using only a static analysis. The primary target should be Linux executables,
460
460
though the code should be designed to be extensible to Windows PE and macOS Dylib and exes.
@@ -593,13 +593,13 @@ The features and TODO for this updated server would be:
593
593
- Aggregate more and new packages vulnerabilities feeds,
594
594
595
595
- Automating correlation: add smart relationship detection to infer new
596
- relatiosnhips between available packages and vulnerabilities from mining the
596
+ relationships between available packages and vulnerabilities from mining the
597
597
graph of existing relations.
598
598
599
599
- Create a ScanCode plugin to report vulnerabilities with detected packages
600
600
using this data.
601
601
602
- - Integrate API lookup on the server withe the AboutCode Manager UI
602
+ - Integrate API lookup on the server with the AboutCode Manager UI
603
603
604
604
- Create a UI and model for community curation of vulnerability to package
605
605
mappings, correlations and enhancements.
@@ -657,7 +657,7 @@ There are several areas to research and prototype such as:
657
657
- A data structure to match efficiently a batch of fix-width checksums (e.g.
658
658
SHA1) against a large index of such checksums, where each checksum points to
659
659
one or more files or packages. A possible direction is to use finite state
660
- transducers, specialized B-tree indexes, blomm -like filters. Since when a
660
+ transducers, specialized B-tree indexes, bloom -like filters. Since when a
661
661
codebase is being matched there can be millions of lookups to do, the batch
662
662
matching is preferred.
663
663
@@ -677,7 +677,7 @@ There are several areas to research and prototype such as:
677
677
678
678
- Feature hashing research: we deal with many "features" and hashing to limit
679
679
the number and size of the each features seems to be a valuable thing. The goal
680
- is to research the validaty of feature hashing with short hashes (15, 16 and
680
+ is to research the validity of feature hashing with short hashes (15, 16 and
681
681
32 bits) and evaluate if this leads to acceptable false-positive and loss of
682
682
accuracy in the context of the data structures mentioned above.
683
683
@@ -708,5 +708,5 @@ This is a green field project.
708
708
Mentoring
709
709
---------
710
710
711
- We welcome new mentors to help with the program and require some good unerstanding of the project
711
+ We welcome new mentors to help with the program and require some good understanding of the project
712
712
codebase and domain to join as a mentor. Contact the team on Gitter.
0 commit comments